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NILPROGRESSIONS AND GROUPS WITH MODERATE GROWTH 


EMMANUEL BREUILLARD AND MATTHEW C. H. TOINTON 


Abstract. We show that doubling at some large scale in a Cayley graph implies uniform doubling at 
all subsequent scales. The proof is based on the structure theorem for approximate subgroups proved by 
Green, Tao and the first author. We also give a number of applications to the geometry and spectrum 
of finite Cayley graphs. Eor example, we show that a finite group has moderate growth in the sense of 
Diaconis and Saloff-Coste if and only if its diameter is larger than a fixed power of the cardinality of the 
group. We call such groups almost flat and show that they have a subgroup of bounded index admitting 
a cyclic quotient of comparable diameter. We also give bounds on the Cheeger constant, first eigenvalue 
of the Laplacian, and mixing time. This can be seen as a finite-group version of Gromov’s theorem on 
groups with polynomial growth. It also improves on a result of Lackenby regarding property (r) in towers 
of coverings. Another consequence is a universal upper bound on the diameter of all finite simple groups, 
independent of the CFSG. 
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1. Introduction 

Let G be a group generated by a finite, symmetric subset S. Here, and throughout this paper, by 
writing that S is symmetric we mean that if s belongs to S then so does and that S contains the 
identity. When speaking of the growth of G with respect to S', we refer to the behaviour of the sequence 
of cardinalities 

isus^us^i,..., 

where we denote by S"" the re-fold product set {si • ... • s„; s* G S}. This is the ball of radius re in the 
Cayley graph of G relative to S. In the event that G is finite, we also consider the diameter diam 5 (G) of 
G with respect to S, which is defined to be the minimum re such that S"" = G. 

According to Gromov’s polynomial growth theorem [23], if the sequence {IS*^!} is bounded above by a 
polynomial function of re, then G has a finite-index nilpotent subgroup. In this paper we are concerned 

Key words and phrases, polynomial growth, volume doubling, diameter bounds, finite groups, approximate groups, mod¬ 
erate growth, mixing time. 
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with some refinements of this theorem, mostly in the context of finite groups. In particular, we study 
the relations between the diameter and the cardinality of a finite group on the one hand, and various 
interesting invariants such as the Cheeger constant, the first non-zero eigenvalue of the Laplace operator, 
and the mixing time of the associated Cayley graph on the other. 

Our two main results are Theorems II. II and II.31 below. The first concerns the doubling property at one 
given scale in an arbitrary Cayley graph. 

Theorem 1.1 (Doubling at some scale implies doubling at all larger scales). For every K >1 there exist 
no = no{K) G N and 9{K) > 1, such that if S is a finite symmetric set inside some group, and if there 
exists n > no for which 

\S^^+^\ < K\S^\, (1.1) 

then for every m>n and every c G N we have < 9{Kfi\S'^\. 

We stress that the constants no{K),9{K) depend only on K, and not on the group or on S. Most 
of our results in this paper are uniform in the generating set S and do not assume that S is bounded. 
Unfortunately the method does not allow for an effective computation of the constants no{K) and 9{K) 
in terms of K. 

The doubling property played an important role in Gromov’s original proof of his polynomial-growth 
theorem. Indeed, the first step of his proof consisted of observing that the polynomial growth condition 
on I S'” I implies that a doubling condition such as (ll.ip holds for infinitely many n and some uniform K. 
Green, Tao and the first author established in m a structure theorem for doubling sets (or equivalently 
approximate subgroups; see Section [2]) in arbitrary groups, allowing them to extend Gromov’s theorem in 
a number of ways. The proof of Theorem 11.11 makes crucial use of this structure theorem via an analysis of 
the behaviour of large powers of nilprogressions (see Section[3l below, for the definition of nilprogressions). 

Remark. Tao m has independently used a somewhat similar argument to classify fairly explicitly the 
possible behaviours of IS™! as m ^ oo under the hypothesis m, in particular recovering Theorem ll.il 


The motivation for Theorem o came from work of Benjamini, Finucane and Tessera [1], in which the 
authors identify the scaling limit of a sequence of vertex-transitive graphs of large diameter (i.e. such that 
()1.2h below holds). Using Theorem 11.11 they shovQ that for a given e > 0, the family of all such Cayley 
graphs (rescaled so that edges have length 1/ diam 5 (G)) is relatively compact for the Gromov-Hausdorff 
topology on the space of compact metric spaces. Indeed a doubling condition such as that of Theorem 
11.11 is exactly what is needed to apply Gromov’s relative compactness criterion |24] . Furthermore, they 
identify the possible limits as the flat tori d = d{e), endowed with a translation invariant Finsler 

metric. 

This prompts the following definition. 

Definition 1.2. Let e > 0. A Cayley graph of a finite group G with symmetric generating set S is called 
e-almost flat if 

diam5(G) > . (1.2) 


^Their original argument required only the main result of |10| . not Theorem ll.il but needed to assume a bound on ISf 
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See [4] and the survey [7] for more details on the result of Benjamini, Finucane and Tessera. We will 
give further applications of Theorem 11.11 below. 

Remark. Abelian groups and more generally nilpotent groups with a bound on the nilpotency class and 
number of generators are e-almost flat for some positive e depending only on these bounds. See Proposition 
sa below. We also show in Proposition 14.151 that the notion of an almost flat group is independent of 
the choice of generating set, provided the size of the generating set remains bounded. 

Our second main result concerns the algebraic structure of almost flat finite groups, and runs as follows. 

Theorem 1.3 (Almost flat groups have large virtually abelian quotients). Let e > 0 and /3 > ^. Every 
e-almost flat Cayley graph of a finite group G with generating set S has a normal subgroup H contained 
in such that G/H has an abelian subgroup whose rank and index are bounded in terms of e and 

fl only. 

The implied constants depend only on e and /3, and not on the group G or the size of the generating 
set S. We refer the reader to Theorem 14.II below for two variants of Theorem 11.31 In particular, we show 
there that G must have a subgroup of bounded index with a cyclic quotient of comparable diameter. 

We note that Gromov had already proved a finite version of his polynomial growth theorem (see at the 
end of [23]). The above can be seen as a refinement, in which a weaker assumption is made (a diameter 
condition instead of a volume growth condition) and a somewhat stronger conclusion is derived. 

It is legitimate to ask whether the conclusion of Theorem ll.3l for those of Theorem 14.II below 1 continues 
to hold under a weaker assumption on the diameter. The answer is no. In fact, given any function 
/ : N —)■ M+ going to zero at infinity, there is a sequence of pairwise non-isomorphic finite groups Gn with 
uniformly bounded generating sets with respect to which their diameters are at least while 

Gn has no proper subgroup of index at most n and no non-trivial abelian quotient. This example also 
provides a counter-example to a non-commutative version of the polynomial Freiman-Ruzsa conjecture. 
See Subsection [Q below. 

We now present some applications of Theorems 11.11 and 11.31 

1.1. Diameter of finite simple groups. The following is an immediate consequence of Theorem 11.31 

Corollary 1.4 (Diameter bound for finite simple groups). Let e > 0. Then there is a constant G^ 
depending only on e such that every non-abelian finite simple group G with symmetric generating set S 
satisfies 

diam 5 '(G) < max 

Proof. Write 7 = diam 5 (G). Since G is simple, if 7 > (|G|/|5'|)^ then Theorem 11.31 implies either that 
7 < 05 ( 7 ^/'^), in which case 7 < 0£(1), or that G has a normal abelian subgroup of index at most 
Ofll). This normal subgroup is either all of G, in which case G is abelian, or it is trivial, in which case 
IG*! < ^^(l), and so certainly 7 < 0 ^( 1 ). □ 

This bound should be compared with the conjecture of Babai (see m) according to which there ought 
to exist universal constants G, D > 0 such that diam 5 (G) < ^(loglGI)*^ for all finite simple groups G 
and all S. Such a strong bound is currently only known for finite simple groups of bounded rank (see 
[H [27] 02] ). In fact for these groups it is even conjectured that one can take G = 1 with a multiplicative 
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constant D that is allowed to depend on the rank [Q Conjecture 4.5]. For alternating groups An-, weaker 
bounds that are still much better than that of Corollary 11.41 are known (see 128] 1. 

Corollary 11.41 is the first non-triyial bound towards Babai’s conjecture yalid for all finite simple groups 
as far as we can tell. Howeyer, eyen if it is not straightforward, it is likely that a better bound also yalid 
for all finite simple groups is within reach by current methods using the classification of hnite simple 
groups and the aboye mentioned results. Let us note howeyer that a remarkable feature of the proof of 
Corollary 11.41 is that it does not use the classification of finite simple groups. 


1.2. Cheeger constant and spectral gap. It is well known that the diameter of a Cayley graph is 
related to its Cheeger constant h and to the first eigenyalue Ai of the Laplace operator in such a way that 


8(diam5(G))2 


< -h^ < Ai < 2\S\h < 


8|5plog|G| 

diam5(G) 


(1.3) 


See Section [5] for a definition of these quantities and a proof of (II.3p . 


Corollary 1.5 (Groups with yery small Cheeger constant or spectral gap). Let G be a finite group with 
finite symmetric generating set S. Let e > 0. Assume that the Cayley graph of G with respect to S 
satisfies one of the inequalities 

A,<min{2A(||) }, /. < mi„ { 2 A (||) }■ 

Then G is e/3-almost flat. In particular, the conclusion of Theorem 1 1. 31 holds. 


This is useful in a yariety of contexts, for example in the work of Ellenberg, Hall and Kowalski [TS] in 
arithmetic geometry, where they used a weak lower bound on Ai for certain perfect groups due to Pyber 
and Szabo [12]. The bound from Corollary 11.51 would work there equally well. Note that, conyersely, it is 
easy to yerify that if a finite group G has subgroup of bounded index with a cyclic quotient of comparable 
diameter then its Ai is yery small, indeed of order 0 | 5 |(l)/(diam 5 '(G))^. 


Corollary 1.6 (Cheeger constant and spectral gap for almost flat groups). Given e > 0 there are Gi,G 2 > 
0 such that if G is a finite group with e-almost flat Cayley graph, then its first eigenvalue Ai and Cheeger 
constant h satisfy 


8(diam5(G'))2 


< -h^ < Ai < Ci|5|/i2 < 


C 2 \S\‘^ 

(diams(G))2' 


(1.4) 


In fact, there is C 3 > 0 such that 


Ai < 


g3|5| 

(diam5(G))2' 


(1.5) 


In other words the Buser inequality holds in its strong form on almost flat groups, which is to say 
that Ai is comparable to h‘^. This follows from a simple argument using the doubling property at a scale 
comparable to the diameter. See Lemma 15.21 It is known by work of Klartag et al. [30| that for abelian 
G we may take Ci = 16, which is essentially sharp. 
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1.3. Property (r) and towers of coverings. A finitely generated group T is said to have property (r) 
with respect to a sequence of finite-index normal subgroups {r„}„ if the Cheeger constant h{T/Tn) of the 
Cayley graph of the finite group T /r„ with respect to a fixed generating set of F is bounded away from 
zero, independently of n. If F has property (r) with respect to the family of all its finite-index normal 
subgroups then one simply says that F has property (r). This property is independent of the choice of 
generating set and stable under passing to a finite-index subgroup. Property (r) was introduced and 
studied by Lubotzky; see [HlEZllsH] for background. 

It is clear that a finitely generated group that admits a finite-index subgroup with infinite abelianisation 
cannot have property (r), as it will have large virtually cyclic quotients. The converse is not true, because 
of the existence of amenable finitely generated infinite torsion groups, such as the Grigorchuk group (see 
e.g. [38]). However, it is not known whether or not the converse holds in the category of finitely presented 
groups. 

In connection with Thurston’s virtual first Betti number conjecture regarding the topology of 3- 
manifolds, Lackenby [3Tl|32l|33| studied various extra conditions of an algebraic or topological nature on 
a finitely presented group without property (r) that force the existence of a hnite-index subgroup with 
infinite abelianisation. See also |34] . In particular, he showed that if the Cheeger constant h{T /F„) decays 
faster than the square root of |F/F„|, then F has a finite-index subgroup with infinite abelianisation [331 
Theorem 1.1]. As a consequence of Theorem 11.31 we can remove the finite presentability assumption and 
strengthen Lackenby’s result as follows. 

Corollary 1.7 (Strong failure of (r) implies positive virtual Betti number - group form). Let T be a finitely 
generated group, and {F„}„ an infinite family of finite-index normal subgroups sueh that /i(F/F„)[F : F„]^ 
is bounded independently of n, for some fixed e > 0. Then F has a finite-index subgroup with a surjeetive 
homomorphism onto Z. 

Using the well-known transfer principle between spectral properties of finite covers of Riemannian 
manifolds and those of the Cayley graphs of the associated groups of deck transformations (see [ 12 ] , [39] , 
[5[ Appendix]), one has the following geometric consequence of Corollary 11.71 

Corollary 1.8 (Strong failure of (r) implies positive virtual Betti number - geometric form). Let M be a 
compaet Riemannian manifold and Mn a sequenee of finite Galois covers of M of degree dn- Assume that 
there is e > 0 such that h{Mn){dnY is a bounded sequence, where h{Mn) denotes the Cheeger constant of 
Mn- Then M has a finite cover with positive first Betti number. 

Note that we could have stated these corollaries with Ai in place of h, because of the Cheeger-Buser 
inequalities (II.3p . We take this opportunity to raise the following question. 

Question: Can the assumption that the covers are Galois in Corollary 11.81 or the assumption that F„ 
is normal in Corollary 11.71 be removed? Suppose even that the (not necessarily normal) finite-index 
subgroups F„ are such that the diameter of the Schreier graph of F/F„ is at least [F,F„]^, for some fixed 
e > 0. Does this imply that F virtually maps onto Z? 

1.4. Groups with moderate growth and mixing times. Given positive constants A and d, Diaconis 
and Saloff-Coste m say that the finite group G has {A, d)-moderate growth with respect to a finite 
generating set S if, writing 7 := diam 5 (G) for the diameter of G with respect to S, we have 



( 1 . 6 ) 
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whenever 1 < n < 7 . It is clear from the definition, taking n = 1, that {G,S) is ^-almost flat, provided 
|G|/|S'| is large enough (in fact this holds as soon as the diameter diam 5 (G) is larger that some constant 
depending only on A and d). In particular, by Theorem 11.31 such a group has a large virtually cyclic 
quotient. Using Theorem 11.11 we can prove that, conversely, every e-almost flat group is of moderate 
growth. 

Corollary 1.9 (Groups with moderate growth versus almost flat groups). A group has moderate growth 
if and only if it is almost flat. More precisely, given A,d > Q there is C{A, d) > 0 such that every finite 
group with (^4, d)-moderate growth and diameter at least C{A, d) is ^-almost flat. Conversely, given e > 0 
there are A,d>0 depending only on e, such that every e-almost flat group has {A, d)-moderate growth. 

The observation is that the almost flat condition implies the volume doubling condition at some rel¬ 
atively small scale (at most (diam 5 (G))°^^^; see Lemma IT^ below), and hence at all subsequent scales 
by Theorem 11.11 Thus almost flat groups are doubling at all scales larger than (diam 5 (G))°^^^. This is 
enough to guarantee moderate growth. 

Groups with moderate growth provide examples of Gayley graphs for which the mixing time of the 
simple random walk behaves quadratically in the diameter. This is the main result proved by Diaconis 
and Saloff-Coste in |15] . which now translates as follows. We write pc for the uniform probability measure 

in) 

on G, PS foi^ the uniform probability measure on S, and pg ' for the n-th convolution power of pg. 

Corollary 1.10 (Mixing time). The relaxation time and the mixing times of almost flat Cayley graphs 
are comparable to (diam 5 (G))^. More precisely, given e > 0, there are positive constants B,C,D > 0, 
depending on e only, such that whenever G is a finite group whose Cayley graph {G, S) is e-almost flat 
and satisfies 7 := diam 5 (G) > D, for all n> 1 and all p G [l,oo] we have 

"I _/-T n ( \ _ n 

2 ® ^\\TG\\p<\\dP - TG\\p<Be 

where || • ||p denotes the Id-norm on G. Indeed, the upper bound holds even without the assumption that 
7 > H. 

We stress that the bounds for G and B obtained by Diaconis and Saloff-Coste are explicit when 
expressed in terms of the moderate growth parameters A and d. In the above reformulation in terms 
of e-almost flatness, we lose the explicit dependence, because the moderate growth parameters A and d 
given by the second part of Corollary 11.91 depend on e in a non-explicit manner (due to our reliance on 
the non-explicit Theorem 12.31 below). 

The mixing times (for each norm) are at least half the diameter, but for a general Cayley graph of 
bounded degree, they are always in 0 ( 7 ^), where 7 is the diameter, as easily follows from the lower bound 
on Ai from This bound is attained for example for the uniform mixing time on the lamplighter 

group with cyclic base (e.g. see m)- At the opposite extreme, if the Cayley graph is an expander 
graph (uniform lower bound on Ai), then the mixing times are comparable to the diameter and a cut-off 
phenomenon is expected (and proved in some cases, see for that matter [36] )• For almost flat Cayley 
graphs, no cut-off phenomenon arises. 

We conclude with the following refinement of the Diaconis-Saloff-Coste result showing that doubling 
at one small enough scale is sufficient to guarantee quadratic mixing time. 

Theorem 1.11 (Quadratic mixing under a doubling condition). Given K >1, there is nflK) such that 
the following holds. Consider the Cayley graph G of a finite group G with generating set S and diameter 
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7 . Assume that < K\S'^\ for some n > ni{K) with n < 7 ^/^. Then the mixing times of the simple 

random walk on Q are comparable to 7 ^ up to multiplicative constants depending on K and IS"! only. 

We provide an example showing that the exponent | is sharp; see Example 16.51 


The paper is organised as follows. In Section 2, we discuss approximate groups and recall the main 
result of m, which describes the structure of approximate subgroups of arbitrary groups. We also prove 
Theorem ll.ll modulo a result on nilprogressions proved in Section 3. Section 3 is devoted to nilprogressions. 
We give a summary of several competing definitions and explain the relationship between them. Then we 
prove that their powers have uniform doubling, thus completing the proof of Theorem 11.11 Section 4 is 
devoted to Theorem 11.31 and several extra structural results on almost flat groups. We also describe there 
an example pertaining to a non-commutative version of the Freiman-Ruzsa conjecture and showing that 
super-polynomial bounds are required in m- In Section 5 we discuss bounds on the Cheeger constant 
and Ai for almost flat Cayley graphs and prove Corollaries 11.71 and 11.81 regarding property (r). Finally 
Section 6 is devoted to random walks and mixing times. We establish Corollary II.lUI and Theorem 11.111 

Acknowledgements. The authors are grateful to David Simmons, Romain Tessera and Tianyi Zheng 
for helpful conversations. We are also grateful to Persi Diaconis and Terry Tao for their comments on the 
paper. Both authors are supported by ERC grant GA617129 ‘GeTeMo’. M.T. is on leave from a Junior 
Research Fellowship at Homerton Gollege, Gambridge, where some of this work was carried out. 

2. Approximate groups 

In recent years there has been much effort to describe the structure of certain finite subsets of groups 
called approximate subgroups. The reader may consult the surveys [m 1201111 ] for details on the background 
to this field and its applications. The definition of an approximate group is originally due to Tao [15] . 
and runs as follows. 

Definition 2.1 (Approximate group). Let G be a group and let A > 1. A finite subset A C G is said 
to be a K-approximate subgroup of G, or just a K-approximate group, if it is symmetric and if there is a 
subset X with |A| < A such that C XA. 

Here, and throughout the paper, for subsets X and F of a group we denote by XY the set {xy : x G 
X,y &Y} and by X~^ the set {x~^ : x G A}. 

It may not be immediately obvious that approximate groups should have anything to do with growth of 
groups. However, the following simple argument gives a reason to expect that the two should be related, 
at least on some level. 

Lemma 2.2. Let G be a group, and let S C G be a finite symmetric set. Suppose that 

|^5n| < AI,?’"!, ( 2 . 1 ) 

for some integer n, then is a K-approximate subgroup of G. 

Proof. See Ruzsa’s covering lemma |48] . Upon replacing S by S'” we may assume that n = 1. Let X be 
a subset of that is maximal with respect to the condition that the sets xS with x £ X are disjoint, 
and note that this condition implies that |AS| = |A||S|. Given ()2.1I1 . and since XS C S^, this implies in 
particular that |A| < A. It is therefore sufficient to show that S^ C XS'^. This holds, because if z G 
then, by the maximality of X, there are x £ X and si, S2 £ S such that zsi = XS2. The symmetry of S 
then implies that z = ^ XS‘^, as claimed. □ 






EMMANUEL BREUILLARD AND MATTHEW C. H. TOINTON 


Recently, Green, Tao and the first author [ID] proved the following structure theorem for approximate 
subgroups of arbitrary groups. 

Theorem 2.3 (Structure of approximate groups [101 Theorem 1.6]). Let A be a K-approximate subgroup 
of a group G. Then there is a finite subgroup H <Z and a subgroup T containing H as a normal 
subgroup such that T/H is nilpotent of rank and class at most Oxi^), and such that A is contained in 
fewer than Oa:(1) left-cosets ofT. 

As usual Or:(1) denotes a constant that depends on K only. Note that since the group T is finite-by- 
nilpotent it is also virtually nilpotent. As remarked in m, this result easily implies Gromov’s original 
theorem on groups with polynomial growth. More precisely one has the following somewhat stronger 
statement. 

Corollary 2.4 (One-scale Gromov’s theorem). For every K >1 there exists uq = no{K) E N such that 
if G is a group generated by a finite symmetric generating set S such that |S'®”| < K\S^\ for some n > no 
then G is virtually nilpotent. 

Remark 2.5. Using Lemma 12.101 below, one sees that the hypothesis lA®”! < K\S^\ can be weakened into 
|^ 2 n+i| ^ x\S^\. It is an interesting problem to determine whether one can push this further and assume 
only that < A'|5"| for some fixed a > 1. 

Remark 2.6. If we were to allow the uniform doubling constant in the conclusion of Theorem 1 1.1 1 to depend 
on the generating set S then that theorem would become a straightforward application of Gorollary 12.41 
as it is well known that large balls in Gayley graphs of finitely generated virtually nilpotent groups are 
doubling (see [3], for example). However, the independence from S is key to our applications. 

We recall the easily seen fact that if a group has polynomial growth, which is to say that |S'"'| = 0{n^) 
for some D > 0, then there are infinitely many n for which < 6'^|5”|, and the hypotheses of Corollary 
El are satisfied at infinitely many scales. By contrast, the corollary asserts that only one large scale is 
necessary. For the reader’s convenience, and also for further use below, we now recall how to derive this 
corollary from Theorem 12.31 as was done in [101 Corollary 11.2]. 

Theorem l2.3l shows that an approximate group lies in finitely many cosets of a nilpotent group. However, 
Corollary 12.41 requires that the entire group lie in finitely many cosets of a nilpotent group. The key to 
overcoming this apparent difficulty is to observe that if the n-ball in a Cayley graph is contained in at 
most n cosets of a subgroup, then this subgroup has index at most n, as follows. 

Lemma 2.7. Let G be a group generated by a finite symmetric set S containing the identity, and let T be 
a subgroup of G. Suppose that for some k £ N we have contained in at most k left-cosets ofT. Then 
G = S^r, and in particular [G : L] < k. 

Proof. Since ST C S^L C S^L C • • • , the number of left-cosets of L having non-empty intersection with 
S’T is non-decreasing in r. Thus, if is contained in at most k left-cosets of F, then S^'’'^r = S^F for 
some r < k. It follows by induction that G = S^F, and hence that G = S^T. □ 

Proof of Corollary \2.4\ By Lemma 12.2[ S^"" is a AT-approximate subgroup of G. By Theorem 12.31 there 
exists no = no{K) such that S^” is contained in at most no cosets of a virtually nilpotent subgroup F < G. 
If n > no then Lemma 12.71 therefore implies that F is of finite index in G, and hence that G is virtually 
nilpotent. □ 

We prove Theorem II. II bv first reducing it to the nilpotent case, which we state as follows. 
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Theorem 2.8 (Nilpotent case). Let K > 1 and s > 1. Suppose that S is K-approximate subgroup in 
some s-step nilpotent group G. Then there is constant C = C{K,s) > 1 such that for every m >1, 
is a C-approximate subgroup. 

In the remainder of this section, we explain how to perform such a reduction using Theorem 12.31 The 
proof of the nilpotent case will occupy the next section and rely on a study of nilprogressions. 

Remark. Contrary to the general case of Theorem 11.11 where no effective estimate is known on the size 
of 0{K), an effective bound in the nilpotent case can be easily computed by following our argument. 

The reduction of Theorem 11.11 to Theorem 12.81 is based on the following proposition, which makes use 
of the structure theorem for approximate groups. Theorem 12.31 

Proposition 2.9. For every K > 1 there exists no = no{K) such that if G is a group generated by a 
finite symmetric generating set S such that 

< K\S^\ (2.2) 

for some n > uq, then there are subgroups H < T < G such that H C 5®” is normal in G, such that 
r/H is nilpotent of rank and class at most Ok{^), and such that T has index at most no in G. Moreover, 
B := 5®^ n r is an Ok{^)- approximate subgroup containing H, and there is a subset X C of size at 
most no such that for every b gN we have 

Rbn ^ xb'^-^. ( 2 . 3 ) 

Proof. By Lemma 12.21 is a LC-approximate subgroup of G. We may therefore apply Theorem 12.31 to 
and conclude that there is a natural number ni = ni{K), a subgroup Hq C 5®"', and a subgroup 
To < G containing Hq as a normal subgroup, such that Tq/Hq is a nilpotent group of rank and step at 
most Ox(l)) and such that is contained in at most ni cosets of Lq. If n > ni then Lemma [2 . 71 implies 
that Lq has index at most ni in G. In particular, Lo has a normal subgroup L of index at most no := ni!. 

Since L is normal in G, so is C®(r), the s-th term in its descending central series, where s is the 
nilpotency class of Tq/Hq. Setting H := G®(r), we see that T/H is s-step nilpotent and that H C Hq C 

gSn 

is normal in G. It is well known that if F is an approximate subgroup then so is the intersection of 
with any subgroup (see |49l Lemma 2.10], for example). This implies in particular that B := n L 
is an Oic(l)-approximate subgroup. 

It remains to exhibit a set X G satisfying (|2.3p . If n > no then S'^ intersects every coset of L in 
G, and so S"" contains a complete set of coset representatives of L; we claim that it suffices to take X to 
be this set. Indeed, by definition this set satisfies 5^” C XT, and hence 5^” C X(5®"' H L) = XB. In 
particular C S^B, and so C for each 6 E N by induction. The result follows. □ 

The following combinatorial lemma allows us to pass from a small doubling assumption to the more 
comfortable < iL|5”|. 

Lemma 2.10. For every K >2, there is G{K) > 2 such that if S is a finite symmetric subset (containing 
1) in a group G, and if n>l is an integer such that 

|^2n+l| < x\S^\, 

then |55"| < G{K)\S^\. 
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Proof. We apply Petridis’s lemma [4T] (see also m), according to which if A and B are finite subsets 
of a group G such that \BA\ < then there is a subset Aq C A that \BAqX\ < K\AqX\ for every 

finite subset X in G. Applying this lemma with B = and A = gives a subset Aq of with the 
property that 

|5^+MoX| < K\AqX\ 

for all X C G. In particular, this implies that < -AlAol, and hence that < 

K\S^\ < K'^\Aq\. This also shows that |AoSAo| < -AlAol, since AqS C Finally, taking X = Aq and 

noting that Aq G G we see that |Aq| < -AIAqI < K'^\Aq\. 

Now set Ai ;= S U Aq. Note that Af is contained in the union of the subsets Ag, AqSAo, S'^Aq, 
AqS'^, SAqS, AqS, SAq, of which all but Aq are contained in Since both Ag and are of 

size at most iF^|Ao|, this implies that |A^| < SK‘^\Ai\, and so Ai has small tripling. It is well known 
that small tripling implies small fc-pling for every integer fc; more precisely, |45l Lemma 3.4], for example, 
implies that |(Ai U A]"^)^| < OK,k{X)\Ai\. 

We claim that there exists k = Ok{^) such that S'"" is contained in (AiUA]~^)^. This will finish the proof 
of the lemma, since it implies that G (Ai U A]"^)^^, and hence that < Ox,fc(l)|^i| < Oa:(1)|'S'"'|. 

We build recursively sequences Xi G 5” and /cj G Z for i > 0. Take xq to be the identity, and /cq to be 
zero. Then, assuming xq, ...,Xi and fei, ..., fc* have been chosen, let fcj+i be the smallest integer such that 
there is y G with the property that yAi is disjoint from all XjAi for j = 0,..., i, and let Xj+i be 

such an element y. 

Let m be the largest integer such that km < n but km+i > n. Since the XjAi are all disjoint and belong 
to S'"'Ai C there can be at most |5^"'|/|Ai| < of them, and so m < K^. 

On the other hand, we claim that Xi G (Ai U A]"^)^L We verify this by induction. Assuming this holds 
for i, we write Xj+i = su, where u G and s G S'. By construction, uAi must intersect some XjAi 

non-trivially for some j < i. This means that u G XjAiAf^ G (Ai U However, S G Ai, and 

hence Xj+i G (Ai U as desired. 

If a: G S*^ is arbitrary then, by the definition of Xmi there must be some j <m such that xAi intersects 
XjAi non-trivially. This means that x G XjAiAf^, and hence that x G (Ai U We conclude 

that S” C (Ai U The claim, and hence the lemma, now follows, since m < . □ 

Lemma 2.11. Let tt : G ^ G' he a homomorphism with kernel N, and suppose that A is a symmetric 
subset of G such that N G A and such that vr(A) is a K-approximate group. Then A^ is a -approximate 
group. 

Proof. To say that 7r(A) is a LC-approximate group is to say that there exists X G G with \X\ < K such 
that A^ C XAN. Iterating this, we conclude that A^ G X^AN G X^A?. □ 

We can now complete the reduction of Theorem 11.11 to the nilpotent case. 

Proof of Theorem 11.11 modulo Theorem \2.8l Applying Lemma 12.101 we may assume that < iL|S'”|. 

Proposition 12.91 therefore gives a finite-index subgroup T of G and a normal subgroup H contained in 
such that T /H is nilpotent with bounded rank and nilpotency class, such that G XB^ for every 6 > 1, 
where \X\ < uq^K) and B := n T is an Ox(l)-approximate subgroup of T containing H. Applying 
Theorem 12.81 to the nilpotent group T/H, and pulling back to T using Lemma 12.111 we conclude that for 
all 6 G N the set B^^ is an Oii:(l)-approximate group. In particular, 5^^^” C XB^^^ G XY^B'^^, for some 
subset Tf, of size at most Ok{^)- Since C we conclude that is an Oi^(l)-approximate 

subgroup, uniformly for all 6 G N. Write XiQ^n for the set XYi, witnessing this. 
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Given m > IGn, let m' be the smallest integer multiple of 32n that is at least m, and note that 
m > m'I2, and hence that < Oi^< Ok{'^)\S’^\. We also have 

gem ^ gem' ^ 

from which the theorem then follows easily for the value of m in question. For m < 16n, on the other 
hand, the theorem follows from Lemma 12.21 □ 

3. NILPROGRESSIONS 

Our goal in this section is to prove Theorem 12.81 This will conclude the proof of Theorem 11.11 The 
proof relies on the structure theorem for nilpotent approximate groups due to the second author [39], 
which we recall below as Theorem 13.91 as well as an analysis of nilprogressions and their powers, which 
forms the basis of Proposition 13.101 below. 

Nilprogressions are to nilpotent groups what arithmetic progressions are to the infinite cyclic group Z. 
There are several natural ways to define this concept. Although they do not lead to exactly the same 
notions, they are roughly equivalent on some level, as we will see. We start with the following definition. 

Definition 3.1 (Nilprogression, see uni). Let G be a group, let xi,..., x,. G G and let L = (Li,..., L^) 
be a vector of non-negative integers. If the xi,... ,x,. generate an s-step nilpotent subgroup of G then 
the set of all products in the x* and their inverses, in which each Xj and its inverse appear at most Lj 
times between them, is said to be a nilprogression of rank r and step s, and is denoted P*(xi,..., x^; L). 

For the purposes of this paper we need to introduce a new variant of a nilprogression, which we call a 
nilcomplete progression. This is closely related to yet another variant, introduced by Green and the first 
author in [8|, where it is called a nilpotent progression. We show below precisely how they may be thought 
of as ‘roughly equivalent’; see (|3.2p in particular. For further background on this rough equivalence the 
reader may consult the work [49] of the second author. 

In order to dehne nilcomplete progressions, we need to establish some terminology. The following 
dehnition is reproduced from [49], and follows a set up in [8] §1] that was in turn based on [26l §11.1]. 

Definition 3.2 (Gommutators and weights). We dehne (formal) commutators in the letters xi,...,x,. 
recursively by dehning each Xj to be a formal commutator and, given two formal commutators a, a' in 
the Xj, dehning [a, a'] also to be a formal commutator. 

To each commutator a we assign a weight vector x(a) G Nq, dehned recursively by setting xi^i) •= e* 
and, given two formal commutators a, a' in the Xj, dehning x([a, a']) = x(a) + x(aO- We dehne the 
total weight |x(a:)| of a commutator a to be ||x(q;)||i. Given a weight vector y G Nq and a vector 
L = (Li,..., Lr) of positive integers we write to denote the quantity 

:= Lf 

Noting that this results in at most hnitely many commutators of any given weight vector, we assign a 
hxed total ordering -< to the commutators, chosen arbitrarily subject to the conditions that xi x^, 

that commutators of the same weight vector are consecutive, and that commutators of lower total weight 
come before commutators of higher total weight. 

Finally, for each commutator a we dehne the (formal) inverse commutator a.~^. We extend to a 

partial ordering of commutators and their formal inverses, dehning -< when a < j5. 

Gertain commutators in the xi,...,Xr are called basic commutators. These are dehned precisely in 
each of [8l[26lll9]; for the purposes of this paper it will suffice to note that if G is the free s-step nilpotent 
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group on generators xi,...,Xr, and ci,... ,ct' is the ordered (with respect to -<) list of basic commutators 
of weight at most s in the Xj, then the elements 



are all distinct as the li range over Z and, moreover, every element of G can be expressed in this form. 

Definition 3.3 (Nilpotent progression, see [8]). Let G be a group, let xi,..., x^ be elements of G and let 
L = (Li,..., Lr) be a vector of non-negative integers. If xi,..., x^ generate an s-step nilpotent subgroup 
of G then, writing ci,... ,Ct' for the ordered (with respect to -<) list of basic commutators of weight at 
most s in the Xj, the set 

P(xi,...,x,;L) :={c[^ : \k\ < 

is said to be a nilpotent progression of rank r and step s. If the elements are all distinct as the 

li range over then the nilpotent progression P{xi, ..., Xr', L) is said to be proper. 

Remark 3.4. The nilpotent progression P{xi, ..., x^; L) is proper, for example, in the free s-step nilpotent 
group generated by xi,..., x^. 

Remark 3.5. The cardinality of a nilpotent progression P{xi,..., Xr] L) is easily seen to be at most 
+ 1), with equality if and only if P{xi, ..., Xr] L) is proper. In the case that P(xi,..., Xr] L) 
is proper, this implies in particular that |P(xi,..., x^; ML)| < \P{xi, ..., x^; L)| for every M G N. 

We now move on to the definition of a nilcomplete progression. The reason for introducing nilcomplete 
progressions is that they behave particularly well on taking powers or on scaling the side lengths Lj 
(see Proposition (TTO] below). The definition is similar to that of a nilpotent progression, except that we 
use the list of all commutators in the Xj instead of just the basic commutators, and that when taking 
commutators we allow lower-weight components to be replaced by their inverses (so, for example, we allow 
commutators such as [xi,x^^] and [xs, [xj“^, X 2 ]“^]). We express this more formally as follows. 

Definition 3.6 (Generalised commutators). We define the generalised eommutators in the Xj in exactly 
the same way as the commutators, except that if ai and 02 are two generalised commutators in the Xj 
then so is for every ei = ±1 and 62 = ±1. 

We extend the definition of weight vectors from commutators to generalised commutators by requiring 
that the weight vectors of [q;^\ are equal for all choices of ei and € 2 - We extend the order -< arbitrarily 
so that the generalised commutators with different choices of ei and 62 are consecutive. 

Throughout this section we write for the ordered (with respect to ^) list of generalised 

commutators of total weight at most s in xi,..., x^. Note that t is bounded in terms of r and s only. 

Definition 3.7 (Nilcomplete progression). Let G be a group, let xi,...,Xr. be elements of G, and let 
L = (Li,..., Lr) be a vector of non-negative integers. If xi ,... ,Xr generate an s-step nilpotent subgroup 
of G then, writing ^ 1 ,... for the ordered (with respect to -<) list of generalised commutators of weight 
at most s in the Xj, the set 

P(xi, ...,Xr-,L) = : \k\ < 

is said to be a nilcomplete progression of rank r and step s. 

We recall a technical notion that was useful in |49j . 
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Definition 3.8 (Ordered progression). Let xi,... ,Xr be elements of a group and let L = (Li,..., Lr) be 
a vector of non-negative integers. Then the set 

Pord(a;i, ...,Xr]L) := {x[^ • • - xj." : \li\ < Li} 

is said to be an ordered progression of rank r. 

It is immediate from the definitions that 

P{xi, ...,Xr;L) = Pord(6, • • •, 6; (3.1) 

It was shown in |49j that nilprogressions, nilpotent progressions and nilcomplete progressions are ‘essen¬ 
tially’ equivalent, and that nilcomplete progressions control nilpotent approximate groups. In particular, 
we have 

.^ord (^1) • • • ) Xr, L') P (xi,..., Xj. ,L) CT’(xi,..., Xj-, L) C P(x\, ^ Xj-, P) d Poid (^1 j • • ■ ; j L) 

(3.2) 

Here the first and third inclusions are trivial, while the second follows from the commutator collecting 
process, as described in [39]. The last inclusion is proven in [491 Proposition C.l] for P in place of P, but 
precisely the same argument works and yields (13.2p . 

Next we recall the main result of |49j . 

Theorem 3.9 (Structure of nilpotent approximate groups, [IH]). Let G be an s-step nilpotent group, and 
suppose that A is a K-approximate subgroup of G. Then there is a subgroup H normalised by A, a natural 
number r < elements xi,... ,Xr & G, and a vector L = (Li,...,L^) of non-negative integers such 

that 

A c H'P*(xi,...,x^;L) c ffP(xi,...,Xr;L) c ffP(xi,..., Xr; L) c 

Again, this statement was proven in [IS] only for P, and not for P, but precisely the same proof yields 
the result for P as well. 

We are now ready to state the main technical result of this section, which is a description of the 
properties of nilcomplete progressions and their powers. 

Proposition 3.10 (Powers of nilcomplete progressions). Let G be an s-step nilpotent group, let xi,... ,Xr 
be elements of G, and let L = (Li ,..., Lr) be a vector of positive integers. Then the following conditions 
hold. 

(1) P(xi ,... ,Xr', nL) C P(xi ,... ,Xr] 

(2) P(xi ,... ,Xr', L)^ C P(xi,..., Xr',nL). 

(3) For every M G N there is some set X C G of cardinality at most Or,s,Mi^) such that 

P{xi, . . . , Xr] ML) C P(xi, . . . , Xr] L)X. 

Remark 3.11. An inspection of the arguments of |49j reveals that the exponent in (|3.2I1 could be 

written more precisely as or simply Os{r^^^^). An inspection of the arguments below then 

shows that the constant implied by Or,s(n) in Proposition 13.101 (fT|) could be written as 
or more succinctly as Os(r^^^^n). Similarly, the Or,s,M(i) in Proposition 13.101 (i3ll could be written as 

rO(PrOM) ^srO(‘ 

or more succinctly as We leave the details to the interested reader. 
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It is convenient at this point to introduce some further notation. Given a list yi,... ,yk of commutators 
in the Xi we denote by the /c-dimensional vector 

Thus, for example, m can be rewritten more succinctly as 

= (3.3) 

In the event that L = (Li,..., Lr) is a vector of real numbers, as opposed to integers, we define [LJ and 
\L\ coordinate-wise; thus, for example, [Lj = ([TiJ,..., L-^rJ)- When L' is another vector of integers or 
real numbers, we write L < L' or L < L' to indicate that the appropriate inequality holds coordinate-wise. 
Finally we introduce one further definition. 

Definition 3.12 (Nilcomplete set of generalised commutators). A set Y of generalised commutators in a 
list of group elements xi,..., is said to be nilcomplete if whenever ai,a 2 GY and ei = ±1 and e 2 = ±1 
are such that [al^,al^] ^ 1, the generalised commutator also belongs to Y. 

We now pass to the proof of Proposition 13.101 

Proof of PropositionlSJE m) ■ We set B{L) := ^ use the shorthand P{L) := 

P{xi,... ,Xr‘, L). By definition of P (see (j3.3p i we have P{nL) C P(^i; (nL)^*^^i)) • • • P(^i; (nL)^(^d), 
However, [491 Lemma C.2] implies that 

P(Ci; (nL)>^(^d) c 

Noting again that t is bounded in terms of r and s only, we see that 

P(nL) C H(nL)°’'’di) ^ (3.4) 

and the result then follows from the fact that B{L) C P{L). □ 

Proof of Provosition \3.1(A We in fact show that 

Pord(ei, . . . , 6; C Pord(ei, . . . , Cd inlYY 

in light of (13.31) this is sufficient to prove Proposition 13.101 (j2|) . 

First, note the trivial identity 

vu = uv[v,u]. (3.5) 

Since the f,i are generalised commutators, it is very straightforward to use this identity repeatedly to see 
that any string a in the Xj is expressible in the form 

(3.6) 

for some li,...,lt £ For example, the string X2x5~^ is equal to x 5 ~^X 2 [x 2 ,xj“^]. This is essentially a 
simplified version of the collecting process of [261 §11.1]. The challenge is to show that if a is of the 
form p^^^l ■ ■ with each p6) belonging to Pord(Ci; ■ ■ ■ then we may conclude additionally that 

\li\ < (nL)^(«d. 

Let P^^\...,P^^^ be distinct copies of Pord{f,i, ■ ■ ■ so that Pord(?i, • ■ ■, may be 

written as P^) ... p(”-). We bound from above the number of instances of each generalised commutator 
that can occur when for each j we have an element of P^^'l viewed as a string in the in the natural 
way, and the simplified collecting process is applied to the string p^^'l ■ ■ ■ . 
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In order to do this, we give a different label to each occurrence of each generalised commutator in 
the initial string ■ ■ ■ p^'^\ Specifically, given the string p^^'^ = ™ for each z = 1,..., t 

(j\ 

and for z = 1,..., /j let Q be distinct copies of the element Thus we have 


and, in particular 

•,(1) 




■p 


ft’ 


](j) 




uP 


, 0 ) 

«!■ 


_ Aj) Aj) 


e(l) t(l) 

^*>1 ■ ■ ■ \iW 


AA 


tU) 


tl 


U) 


An) Jn) 

ft,l 


«!?■ 


AA 


(3.7) 


Let us apply (13.5p repeatedly to this string so that when we once again identify each occurrence of the 
same generalised commutator it is of the form (|3.6I) . Specifically, beginning with the left-most occurrence 
of ^ 1 , use identity (13.5h to move each instance of to the left until it is to the left of every instance 
of every commutator that is later with respect to the order -<; then perform the same process with the 
instances of ^ 2 ; and so on up to the instances of (which by this time, of course, will be at the extreme 
right of the expression anyway). 

Once this is done, the resultant string of the form (13.61) will consist of commutators whose arguments are 

(i) (i) 

elements of the form Q -i. Some of these will simply be the original elements QA, from the string (13.71) ; let 

us call such elements prime instances of the commutator However, suppose the commutator is equal 

to [ft, ft], with u < V. Then there may also be instances of ft in (j3.6p that resulted from interchanging 

instances of ft with instances of ft; let us call such instances of composite instances. Note that a 

composite instance of the commutator [ft, ft] could have arisen from interchanging a composite instance 

of ft with a composite instance of ft. 

Write p[^'^ for the number of prime instances of ft originating from the progression and Cj for the 
total number of composite instances of Our aim is to prove that p\^^ +... +pf^'^ + Q < {nL)A^*\ Since 


p{A < 


(3.8) 


for each j by definition, this amounts to showing that 

Cj < (zzL)^*'^*^ — 


(3.9) 


This is trivial for i = 1,... ,r, since for each value of z less than r we have Ci = 0, and so by induction 
we may prove (13.91) for an arbitrary fixed value zc of z, under the assumptions that w > r and that ()3.9p 
holds for all smaller values of i. Since w > r we have ft = [ft, ft] for some u and v satisfying u < v < w. 

Let us first bound from above the number of composite instances of ft in which the ft component is 
prime. Indeed, consider first the number of composite instances of ft in which the ft component is a 
prime instance originating from P^A. Prime instances of commutators originating in the same progression 
P^^'^ are in the desired order from the beginning, and so will never swap under the simplified collecting 
process, and so if fts a composite instance of ft in which the ft component is a prime instance originating 
from P^A ^ then its ft component cannot be a prime instance originating from P^A _ 

The number of instances of ft that are not prime instances coming from P^^'^ is at most (nL)^^^’'^ —i^xHv) 
by (13.81) and the induction hypothesis. On the other hand, (13.81) implies that the number of prime instances 
of ft originating from P^A ig at most pA^A ^ The total number of composite instances of ft in which the 
ft component is a prime instance originating from P^A ig therefore at most 
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and so the total number of composite instances of in which the component is a prime instance 
originating from any progression is at most 

(j^lx(5«)l+i _ (3.10) 

Let us now bound the number of composite instances of in which the component is composite. By 
induction, the number of composite instances of is at most whilst by induction and 

(j3.8j) the total number of instances of is at most The number of composite instances of 

in which the component is composite is therefore at most 

(^lx(?u-)l _ j^lxKt-)l+i)_^x(?u-)_ (3-11) 

The total number of composite instances of is therefore at most the sum of the quantities (13.101) and 
(13.lip , which is This verifies (|3.9I) in the case i = w, and so ends the proof. □ 

In light of ()3.3p . Proposition 13.101 ([3]) follows from the following more general statement. 

Proposition 3.13. Let yi,...,yk be a nilcomplete set of generalised commutators in the Xi, appearing 
in order with respect to -<. Then for every M E N there is some set X <Z G of size bounded in terms of 
k, s, M such that 

^ord(yi, {MLY^y^) c Pord(2/i,... 

The first containment of (13.2p expresses the trivial fact that an ordered progression is contained in 
inside the corresponding nilprogression. The following useful lemma shows that the reverse containment 
is also approximately true if the generators form a nilcomplete set. 

Lemma 3.14. Let yi, ■ ■ ■ ,yk be a nilcomplete set of generalised commutators in the Xi, appearing in order 
with respect to -<. Then 

P*iyi,...,yk-, L^^y')) C Pord(2/i, • • •, l/fc) (Oa:,.(1)L)^(")). 

Proof. Since the yi are in order with respect to for each j the sequence yj,... ,yk is still nilcomplete, 
and so by induction on k it suffices to prove that 

P*{yi,..., yk-, L^(y^) c P(yi; L^^y^'^)P* {y^, ...,yk-, (Ofc,.(l)L)^(^^),..., (Ofc,,(l)L)>^(^'=)). 

By (|3.2p it is in fact sufficient to prove that 

P(yi,..., C P(yi; L^^y^^)P*{y2, ...^yw, (Ofc,,(l)L)^(^^),..., (Ofc,,(l)L)^(^'=)). 

However, the fact that the set of yi is nilcomplete implies that each basic commutator in the yi is itself equal 
to some yj, and so this inclusion follows from the fact that there are at most Ofc,s(l) basic commutators 
in the yi of total weight at most s. □ 

Lemma 3.15. Let z,yi,... ,yk be a nilcomplete set of generalised commutators in the Xi, appearing in 
order with respect to -<. Let m E Z, and suppose that 

\m\ < (3.12) 


Then 


z^P*{yi, ...,yk-, L^^y^) C P*(yi,..., (Ofc,,(l)L)>^(^))z™. 
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Proof. Let p be an arbitrary element oi P*{yi,... ,yk', viewed in the natural way as a string in the 

yi and their inverses. Repeated application of the identity uv = [u~^,v~^]vu implies that there is some 
formal string p' obtained by inserting into p, at various places, commutators of the form 




(3.13) 


or 


such that 




Z^p = p'z^ 


(3.14) 


as group elements. By [491 Proposition B.3], for each j there are natural numbers tj and 
with 

tj < Os(l), (3.15) 

and generalised commutators ..., Qj^t. and Cj i,..., C,j^t. in z and yj, such that 


and 






Moreover, writing ojjj for the z-weight of the generalised commutator considered as a commutator in 
z and yj, the bound (I3.12j) implies that the integers rijj arising from [49l Proposition B.3] satisfy 


\nj,i\ < (3.16) 

We may therefore replace each instance of (I3.13P in p' with a string of the form ''' ^j i ^ each 

instance of (|3.14l) with a string of the form ''' Cj f • Write p" for the string produced by making 
each of these replacements, so that p' and p" are equal as group elements. 

Now, each instance of (13.131) or (|3.14l) appearing in p' arises from an element of the form yj or y~^ 
in the string p. There are at most such elements in the string p by definition, and so for each j 

there can be at most instances of (|3.13l) and (|3.14p between them. In particular, there are at most 

instances of the strings and Qjj in the string p". The inequality ()3.16p therefore implies 
that the total number of instances of the strings Qj and Cj,i in the string p" is at most Here, 

x(Cj,i) refers to the weight of (jj as a generalised commutator in the x*. 

Since the list z,yi,... ,yk is nilcomplete, each (jj is equal to some y^^, and each ^jj is equal to some 
yp . Let p be the string of yi formed from p" by replacing each instance of Qj in p" by ., and each 

instance of Qj by y? ,. It follows from (|3.15p that the total number of ("j,* and Qjj is at most Oa:,s( 1)) and 
so the preceding paragraph and the definition of p imply that the string p contains at most (Ofc^s(l)L)^^^*) 
instances of yi and its inverse. In particular, p G P*(yi,..., , and so the lemma follows 

from the fact that p and p' are equal as group elements. □ 


We can now finally prove Proposition 13.131 


Proof of Proposition \3.1,‘A We prove that there is a set X of size bounded in terms of s and M such that 
Pord(yi, {MLY^y^) c Pord(yi, • •. ,yk\L^^y^\{Ok,s{M)LY^y^\ ..., {Ok,smLY^y^^)x-, 
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by induction on k this suffices to prove the proposition. More precisely, we prove that this holds with 
X = which has cardinality at most 2M® + 1. To prove that it holds with this choice 

of X, note simply that 

Pord{yi,...,yk;{ML)^^y^) 

by Lemma 13.151 and that 

P*{y2, ...,yk; iOkAM)L)^^y^\ ..., (Ofc,,(M)L)>^(^'')) 

C Pord(y2,... ,2/fc; {Ok,s{M)LAy^\ ..., iOk,s{M)LA^yA 
by Lemma 13.141 This ends the proof of Proposition 13.131 and hence of Proposition 13.lOl (j3]) . □ 

Having proved Proposition I3.1UI we can now easily deduce Theorem 12.81 

Proof of Theorem \2.8l First we claim that it is enough to find positive integers Nq and C depending 
only on K and s such that is a C-approximate subgroup for all n > 1. Indeed, since 5 is a K- 

approximate subgroup, there is some Y G G, the size of which is bounded in terms of K and s, such 
that 3"^^° C SY. Given any integer m > there exists n > 1 such that uNq < m < [n + l)iVo) and 
so C 52 (n+i)Afo-i _ g 2 nNo-ig 2 No g2nNoY S'^^°XY for some subset X of size at most C. For 
"kn < A^O) meanwhile, we have S'^™' C 3“^^° C 3‘^Y. This proves the claim. 

Now, without loss of generality, we may assume that 3 generates G. By Theorem 13.91 there is a finite 
subgroup H < G normalised by 3, and hence normal in G; integers r,M < and a nilcomplete 

progression P{L) := P{xi, ..., Xr] L) for some elements xi, ...,Xr in G such that 

3 C HP{L) C 

Increasing M if necessary, Proposition l3.ini (fT]l implies that P{nL) C P{L)^^ for all n > 1. Let Nq := M^. 
Since H is normal in G, we have {HP{L)A = HP{LA for every n > 1. Applying Proposition 13.101 (l2|) 
and we therefore see that for each n > 1 there is a subset Xn of size bounded in terms of s and K 
such that 

g 2 Non ^ ^ HP{Lf^°^ C HP{2NonL) 

C P{nL)Xn C H(P{L))^^Xn C 3^"^Xn = S^^^Xn, 
and so the theorem is proved. □ 

This also completes the proof of Theorem 11.11 

3.1. Doubling of the various types of progression. We have made heavy use of the principle that 
sets of small doubling are controlled by nilpregressions. In this subsection we take the opportunity to 
explain how Proposition 13.101 can be used to shed light on the extent to which the converse holds. 

First, we note that nilcomplete progressions always enjoy small doubling. Indeed, an immediate con¬ 
sequence of Proposition 13.101 is that for every nilcomplete progression P of rank r and step s there is a 
set X of cardinality Or,s(l) such that P C PX. 

For nilprogressions the situation is not quite so straightforward; one may check, for example, that in 
the free s-step nilpotent group generated by xi and X 2 the ratio |P*(xi, X 2 ; L, l)^|/|P*(xi, X 2 ; L, 1)| tends 
to infinity as L —>• 00 . Nonetheless, it is shown in Tao’s book [461 Proposition 2.2.3] that, provided the 
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lengths Li,... ,Lr are large enough in terms of the rank r and step s only, a nilprogression is an Or,s(l)- 
approximate group. We recover this fact, and obtain a similar statement for nilpotent progressions, as 
follows. 


Corollary 3.16. Fix r,s > 1. Then there is a constant A = Xr^s such that whenever xi,...,Xr are 
elements in an s-step nilpotent group and L = (Li,..., Lr) is a vector of integers that are at least X, there 
exists a set X of cardinality at most Or,s(l) such that the nilprogression P* = P*{xi,... ,Xr]L) and the 
nilpotent progression P = P{xi,... ,Xr', L) satisfy 

{P*f C C P*X C PX. 

In particular, since it is symmetric the nilprogression P* is an Or,s{I)-approximate group. 

Proof. Note that (j3.4h implies the existence of a constant C = Cr,s such that 

P{xi, . . . , Xr', L') C P*{xi, . . . , Xr] CL') (3-17) 

for every L' = (L '^,..., L^). We claim that taking A = 2C satisfies the requirements of the corollary; thus 
we may assume that Li,..., > 2(7. This implies in particular that 


\Li/2C) < LJC 


(3.18) 


for each i. Denote by \L/2C~\ the vector {\Li/2C~\,... , \Lr/2C~\), and let X be the set obtained by 
applying Proposition 13.101 (l3l) to P{xi, ... ,Xr] \L/2C~\) in the case M = 4(7, so that 


Now note that 

P*{xu 


P{xi,...,Xr-AC\L/2C~\) C P{xi,...,Xr, \L/2Cf)X. 


Xr; T)^ c P{xi ,... 

, Xy-, Z/) 

by ([321) 

C P(xi,... 

, Xf'i 

by ([321) 

C P{xi,... 

, Xf", 2Z/) 

bv Proposition 13.101 

CP{xi,... 

,Xr-AC\L/2C]) 


C P{xi,... 

,Xr;\L/2C])X 

bv (|3.l9l) 

CP*(xi,.. 

. ,Xr', L)X 

bv (13-171) and (|3.18h 

C P{xi,... 

, Xr', L)X 

by (1321). 


(3.19) 


□ 


One may show by a similar argument, making use of Remark 13.51 that a proper s-step nilpotent 
progression P of rank r satisfies \P^\ < Ok,r,s{^)\P\, irrespective of its side lengths. We leave the details 
to the interested reader. 


4. A GrOMOV-TYPE theorem for FINITE GROUPS 

In this section we prove Theorem ll. 31 about the algebraic structure of almost flat Cayley graphs, together 
with two variants of that statement, which we gather as follows. 
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Theorem 4.1 (Structure of almost flat finite groups). Let £,5 >0. Let G be a finite group generated by 
a symmetric subset S whose Cayley graph has diameter 7 := diam 5 (G) and is e-almost flat, which is to 
say that 



Then there is a constant depending only on e, 5 such that if j then 

(1) G has a normal subgroup H contained in such that G/H has a nilpotent subgroup of index, 
rank and nilpotency class bounded in terms of e and 6 only; 

(2) G has a normal subgroup H contained in such that G/H has an abelian subgroup of index 

and rank bounded in terms of e and 6 only; 

(3) G has a normal subgroup Gq of index at most G^ that admits a cyclic quotient with diameter at 
least Ci;'y, where G^,c^ are positive constants depending on s only. 

In (3), the diameter of the cyclic quotient is understood with respect to a system of generators of Gq 
derived from S by the usual Reidemeister-Schreier process, which we summarise in the following lemma. 

Lemma 4.2. Let G be a group generated by a finite symmetric set S, and let Gq be a subgroup of G of 
index d. Then Gq n contains a symmetric generating set Sq for Gq satisfying ^IS"! < \So\ < dlS"! 

and ^(diam 5 (G) — d) < diam 5 g(Go) < diam 5 (G). 

Proof. Let T be a set of right-coset representatives for Gq in G, and for x G G write r(x) for the unique 
element of T such that x G Got{x). It is shown in [26l Lemma 7.2.2] that the set Sq = {tsT{ts)~^ : s G 
S,t gT} is symmetric and generates Gq with diameter at most diam 5 (G). We may assume that d > 1, 
in which case S intersects at least two cosets of Gq . The argument of Lemma 12.71 therefore shows that we 
may take T C and so C Gq n and diams'(G) < 2 (idiam 5 (,(Go) -|- d, as desired. It is trivial 

that |5o| < d|5'|, and the fact that \Sq\ > follows because there must be a coset of Gq containing at 
least ^|5| elements of S. □ 

Remark 4.3. Lemma 14.21 implies that if the index of Gq is bounded and diam 5 (G) is sufficiently large 
then (G, S) is almost flat if and only if (Gq, Sq) is almost flat. 

Remark 4.4. Theorem 14.11 (3) can be seen as a finite analogue of the key step in Gromov’s proof of 
his polynomial growth theorem [23] in which he establishes that every finitely generated group with 
polynomial growth has a finite index subgroup, which maps onto Z. 

Remark 4.5. Lee and Makarychev |35] show that if the Cayley graph of a finite group is uniformly doubling, 
then a similar conclusion to that of Theorem 14.11 (3) holds. Their proof is very different and closer to 
Kleiner’s approach to Gromov’s theorem via harmonic functions, and requires the doubling assumption 
to hold at all scales. By contrast, our argument requires doubling only at one large scale (see Prop. I4.24p . 

Remark 4.6. The proof of Theorem 14.11 (2) shows, more generally, that for any integer s > 1 the group 

G has a normal subgroup H contained in S’P” ^ such that G/H has an s-step nilpotent subgroup of 
index and rank bounded in terms of e and 5 only. 

We also prove the following result, which may be thought of as a converse to Theorem 14.11 

Proposition 4.7 (Nilpotent groups are almost flat). Let G be a finite s-step nilpotent group generated 
by a symmetric subset S. There is a positive constant e = e(s, IS"!) depending only on s and IS"! such that 
diams(G) > |G|L 
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Remark 4.8. The example of F 2 with the standard generating set shows that the e in Proposition 14.71 has 
to depend on |S|. 

We start the proof of Theorem 14.11 by translating the almost flat assumption into a doubling property. 

2 

Lemma 4.9 (Almost flat groups are doubling at some scale). Let e, 5 > 0 and set K = 5^ . Consider an 
e-almost flat Cayley graph of a finite group G with symmetric generating set S. Let 7 := diams'(G) be its 
diameter. Then there is an integer n satisfying 7 *^/^ < n < 7 *^ such that 

|^5n| < 

Proof. Let ko be the largest k such that 5 ^ 7 *^/^ < 57 *^, and note therefore that > 7 ”^/^. The almost 
flat assumption therefore implies that 

|^5'=oy/2| ^ 1^1 ^ |C'||57^/"|/|5| < 71 ^ 157^/^1 = |57^/^| < 52fco/fe|57'/=| = i^fco|57'/^|^ 

from which the conclusion follows easily. □ 

Proof of Theorem \4.1\ (1). Let K = 5^ and let no = no{K) be the integer given by Proposition 12.91 and 
take D large enough to force 7 *^/^ > no. Lemma 14.91 then implies that there exists n with no < n < 7 *^/^ 
such that < K\S'^\. The conclusion then follows from Proposition 12.91 □ 

We now pass to the proof of part (2) of Theorem 14.11 It relies on some structural results about finite 
nilpotent groups, and in particular about commutators and nilprogressions; we refer the reader to Section 
[3] for background on these and definitions. We will also use the following standard fact, which follows 
from [49 1 Proposition B.2], for example. 

Lemma 4.10. Let a be a commutator form of weight k in the sense of [491 Definition 3.2]. Then the 
value of a{yi ,..., y^) mod Gk+i is multilinear in the variables yi,... ,yk. 

This means in particular that the value of a{yi,... ,yk) mod Gk+i depends only on the values of yi 
mod [G, G]. 

Lemma 4.11 (Commutators in hnite nilpotent groups are close to the identity). Let G be a finite s-step 
nilpotent group generated by the nilprogression P = P*{xi,... ,Xr', L). Then the commutator subgroup 
[G,G] is contained in ^ where 7 := diamp(G). 

Proof. By induction on s, it is enough to prove that Gg, the last non-trivial term in the lower central 
series of G, is contained in fact, since Gg is an abelian group generated by the basic 

commutators of weight s in the Xi, of which there are at most 0 ,.^s(l), it suffices to prove that every power 
of each such commutator is contained in pW,s( 7 ^/^), However, Lemma 14.101 means that if a{xj.^, ...,Xjfl 
is a commutator of weight s in the Xi then a{xj^, ...,Xjfl^ = a(x”^,We have x”^ E P'^ by 
dehnition, and so there are ii,... ,ir such that j^ij < Lij and such that x” = x^^ • • • x(]’’ (mod [G, Gj), and 
hence that a(xj^,Xj 2 , = 01=1 ct{xi,Xj. 2 ^ ■■■^Xjfl^L again by Lemma [4. 101 It will therefore suffice to 

show that each a{xi, xj.^^ with |4| < belongs to pW.s( 7 ^/^)_ However, a{xi, xj^,..., E 

P(xi, ... ,Xr', [ 7 ^^*] L) by definition, and this last set is contained in pOr,s{fl “) j^y proof of Proposition 
13.101 p!|) . since in the notation of that proof we have B{L) C P*{L). The lemma is therefore proved. □ 
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Remark 4.12. Essentially the same proof shows that the same result holds if P is a nilpotent progression 
or nilcomplete progression. 

Proof of Theorem \4- 1\ (2). By part (1) applied to (5 = |, we may pass to a quotient and assume that G 
has a normal nilpotent subgroup E whose index, class and rank are bounded in terms of e only. Indeed, 
the ratio |G|/|S| does not increase when passing to a group quotient, and the diameter of the quotient 
will be at least 7 — 7 ^/^. Lemma 14.21 then gives a generating set Sq C for L such that diam 5 (r) is 

comparable to 7 and such that (L, Sq) is P-almost flat for some positive e' depending on e only. Arguing 
as in the proof of Theorem 14.11 (1), and using Lemma 12.21 we see that there exists n < 0 ^, 5 ( 7 ^) such 
that Sq is an , 5 (l)-approximate subgroup. By Theorem 13.91 therefore, and possibly after passing to a 

further quotient, there exists a nilprogression P of bounded rank such that Sq C P C Sq ’ , and hence 

that diamp(r) > Lemma 14.111 therefore implies and we conclude that [r,r] C and 

hence [r,r] C The subgroup H := [r,r] is characteristic in T, and hence normal in G, and 

the quotient G/H has the desired properties. □ 

Lemma 4.13. Let G be a finite nilpotent group of rank r and step s. Then 

\G/[G,G]\ > 

Proof. We denote by G = Go = Gi > G 2 > ... > > G^+i = {1} the lower central series of G. Observe 

that 

\Gk\ = ll\G,/G,+i\, (4.1) 

i=k 

that Gfc/Gfc_|_i is abelian and generated by the basic commutators in the Xi of total weight k (see Section 
[3] or [26]). Let xi,...,Xr be a generating set for G. Note that the group G/[G, G] is generated by the 
elements Xj[G, G]. Write d for the order of the element Xi[G, G] of maximum order in G/[G, G], and note 
that |G/[G, G]| > d. It follows from Lemma 14.101 that the order in the group G^/Gfe+i of a commutator 
of total weight k featuring generators xq,..., is at most d. Therefore, writing h{r, k) for the number 
of basic commutators of total weight k m. r generators we have 

|Gfc_i/Gfc| < 

In particular, writing c(r, s) for the number of basic commutators of total weight at most s in r generators, 
()4.Ih implies that |[G, G]| < The result follows. □ 

Proof of Proposition [23 Let 7 := diam 5 (G). Suppose first that G is abelian. Then |5”| < and hence 
7 > |G|^/I‘^L If G is nilpotent of class s, then Lemma [4. 131 shows that |G| < |G/[G, G]|‘^l^l’'* < 7 l'^l‘^isi,s as 
desired. □ 

We now pass to the proof of the part (3) of Theorem 14.11 For this we require the following fact about 
flat tori. 

Lemma 4.14 (Euclidean tori have large one-dimensional quotients). Let A be a discrete subgroup of rank 
n in M”' and consider the torus T := M”/A, endowed with the standard Euclidean metric. Then there is 
a one-dimensional quotient torus with diameter at least times the diameter ofT. 

Proof. Let vi, ...,Vn be a basis of successive minima of A, with ||ui|| < ... < ||u„||. Let H be the M-span 
of vi, ...,Vn-i. Let To be the quotient torus (ET/H)/{A n H). The diameter of Tq is the supremum over 
all X G M"' of the distance from x to P + A. This is half the distance from Vn to H. On the other hand 
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the diameter diam(T) of T is the supremum inf^gA ||a^ ~ which is at most ^ Yli 11'^*11 ^ fibrill- 

On the other hand, it follows from Minkowski’s second theorem that if Vn denotes the volume of the 
Euclidean n-ball, 


and similarly for ui A ... A Vn-i- 
IbnII < 2(n - iy.d{Vn,H)Vn-l/Vn. 
result follows. 


2 ”/n! < 


Vr, 


< 2 ’’ 


||ui A ... A Vn\\ 

Since ||ui A ... A Vn-i\\d{vn-,H) = ||ui A ... A u„||, we conclude that 
On the other hand, it is well known that Vn-ilVn < u/tt, and so the 

□ 


Proof of Theorem \4-l\ (3). Pick 5 = 1/6. By part (2) of the theorem there is a normal subgroup H 
contained in ^ such that G/H has an abelian subgroup of rank and index bounded in terms of 

e only. Applying Lemma [4.21 we may pass to a subgroup of bounded index and assume that G/H is 
abelian of bounded rank. Lemma 14.91 implies that, provided 7 is larger than some constant depending 
on e only, there is some ni < 7 ^/^ such that is an 0£(l)-approximate subgroup. Its image 5^"^^ 
in G/H is therefore also an Oe(l)-approximate group, and so there is a subgroup Hq G S and a 
generalised arithmetic progression P = P{vi, ...,Vr',L) of bounded rank such that C HqP C 
|22j . On replacing H by the pullback of Hq to G, we may in fact assume that Hq is trivial. Note that if 
7 = diam 5 (G) is larger than a constant depending on £ only, 

2ni diamp(G/Lf) < diam^(G/Lf) < Oe{nidiamp{G/H)). 

The same comparison holds for the diameter of every group quotient. The generators vi, ...,Vr allow us to 
write G/H as Z'’/A for some lattice A; thus, we write x G G/H in coordinate form as x = (xi,... ,Xr). 
Since G/H is abelian, P^ = P{vi, ...,Vr‘,nL) for every n. Let || • || be the norm on M’’ defined by 
||x|| := maX]' \xi/Li\, and define £{x) := mayd[\\xi/LiW. Note that ||x|| < £{x) < ||x|| + 1 and that 

diam.p{G/H) = sup inf £{x — 6). 

We may compare this diameter with the diameter of the Euclidean torus T := M'’/Ap, where Ap is the 
lattice of all vectors of the form (xi/Li, ...,Xr/Lr) with (xi, ...,Xr) G A. Indeed, 

diamp{G/H) < 1 + sup inf ||x — (I|| < 2 + sup inf ||x — 5||, 

<5gA xeM® 

where we have used that infj^ggr ||x — y|| < 1 for all x G because Li > 1. Hence 

diamp(G/H) < 2 + sup inf ||y — (I||oo < 2 + sup inf ||y — (I ||2 = 2 + diam(M'’/Ap), 

ygRr SGAl j/gRr 

where || • II2 is the standard Euclidean norm on Lemma l4. 141 implies that there is a hyperplane Hl 'vo. 

intersecting Ap in a lattice such that the diameter of the one-dimensional torus quotient W/{/Hl + A.i) 
satisfies 

diam(M^/Ap) < -i diam(]R'’/(^p-|-Ap)). 

vr 

Moreover, 

diam(M''/(^p-I-Ap)) = sup inf ||y — 5||2 < a/t sup inf H?/— 5||oo = aA" sup inf ||x —(I||, 

ygRr (Se'KL+Ai ygRr (5 GHl+Ai, 3;gRr (SG'K+A 

where H = {(Lixi,..., L^Xr)', (xi,..., Xr) G Hl}- Writing H-e -.= Hr\ Z^, we then have 

sup inf llx — 511 < 1 -b sup inf llx — 511 < 1 -b sup inf Hx — 5). 

xGR® '5g'H+A a-ggr <5 g'Rz+A x&IX <5eWz+A 
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However, — 6 ) is precisely the diameter of the cyclic quotient Q := I/l{%i + A) 

with respect to the image of P, and so, combining all of the above, we obtain 

diam^(G/P) < ©^(ni diamp(G/P)) < 0£(ni diamp(Q)) < 0£(diam^((5)). 

□ 


This ends the proof of Theorem 14.11 

We now record the following simple consequence of Part (1) of Theorem 14.11 

Corollary 4.15 (Almost flatness is a group property). Let e > 0 and suppose that G is a finite group 
with a symmetrie generating set S with respect to which G is e-almost flat. Suppose that E is some other 
symmetric generating set for G. Then there is e' = e'(e, |P|) > 0 and > 0 depending on e only such 
that diamp(G) > cfiG^^’. 

Proof. By Theorem 14.11 G has a quotient Q of comparable diameter containing a nilpotent subgroup of 
bounded rank, index and nilpotency class, and so the conclusion follows from Proposition 14.71 and Lemma 

IMl □ 

Remark 4.16. We caution that, in spite of Corollary 14.151 for every e > 0 there exists C = Gg such that 
every finite group G satisfying |G| > C has at least one Cayley graph that is e-almost flat and at least one 
Cayley graph that is not e-almost flat. Indeed, taking S = for some x G G gives an almost 

flat Cayley graph. On the other hand, as is well known and easy to prove, every finite group G of size at 
most 2” has a generating set of size at most n and diameter at most n. 

4.1. Explicit bounds, the polynomial Preiman-Ruzsa conjecture and an example. The only 
source of ineffectiveness in our arguments comes from Theorem 12.31 the structural result for approximate 
subgroups of arbitrary groups proved in [TO] • Any explicit bound in terms of K on the number of translates 
of T needed to cover the iL-approximate subgroup A would yield to an explicit bound on no{K) in Theorem 
11.11 and on the index of the abelian subgroup in Theorem 11.31 In this subsection we present an example 
to show that 

(1) Theorem 11.31 is sharp in the sense that the power e cannot be allowed to tend to zero without 
losing the conclusion that the index of the abelian subgroup remains bounded; and 

(2) the bound on the number of translates required in Theorem 12.31 and hence the quantity no{K) in 
Theorem ll.il is worse than polynomial in K. 

We give more precise versions of these statements below as Facts [4T7l and (4.181 as well as giving more 
detail on the context in which they should be viewed. However, first we establish some of the notation of 
our example. 

Let n be an even integer > 8, and let p = p(n) be a prime such that p > n. We define groups 
Ln > > Gn as follows. First, take 

Ln = Sym{n) x F”, 

with the semidirect product defined by the natural action of Sym{n) on the basis vectors, and write 
TT ; —)• Sym{n) for the natural projection. Writing V for the subspace of F” consisting of those vectors 

whose coordinates sum to zero, define the subgroup G'^ of Ln by 

G'n = Sym{n) x V. 
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Finally, define the subgroup Gn of index 2 in G'^ by 

Gn = Alt{n) K V. 

Let S be the generating set of consisting of the identity and the three elements (c;0), (t; 0), 
(1; 1,0,... , 0) and their inverses. Here c is the long cycle (12... n) and r is the transposition (12). It 
is clear that Ln is the direct product of G'^ with the central subgroup Z = made of elements of 

the form (l,u) in which all coordinates of v are equal, and so we may set S' to be the image of S under 
projection to G'^. Finally, let S be the generating set for Gn obtained from S' using Lemmaand note 
that |S| < 8 . 

The reason for making these definitions is that the groups they describe satisfy the following facts. 

Fact 4.17. For every function / : N —(0,+oo) going to zero at infinity there exist choices of p{n) such 
that diam 5 (Gn) > but such that no subgroup of Gn of index < n/2 admits a non-trivial abelian 

quotient. 


Fact 4.18. For every function / : N ^ N, there are a choice of p{n), an increasing sequence of integers 
Kn, and Kn-approximate subgroups An in Ln sueh that, for large enough n, the approximate subgroup 

An is not contained in fewer than Kn°° cosets of any subgroup F < such that F has a normal 

subgroup H contained in An^^"^ with T/Fl solvable. 


Here we have denoted by log^^^ the 4th iteration of log in base e. 

Fact 14.171 establishes statement (1) above, while Fact 14.181 corresponds to statement (2). Fact 14.181 
should be compared with the classical case of approximate subgroups of abelian groups, where it is 
expected that only translates of some coset-progression are needed to cover the approximate 

subgroup (this is the so-called polynomial Freiman-Ruzsa type conjecture, see [ 2 T] and references therein). 
In the abelian case bounds of the form have been established by Sanders |44j . 


Remark. After a first draft of this paper circulated, Eberhard m adapted the construction of Fact 14.181 
to show that for arbitrarily large K there exists a group G with a finite iL-approximate subgroup that is 
not covered by cosets of any subgroup F having a finite normal subgroup H with T/H solvable. 

This nicelv improves the bound in our Fact 14.181 Note that the example in m generates an infinite 
group, while we work here in the setting of finite groups. 


Before we prove Facts ITT71 and ITTHl let us isolate a few helpful statements about the groups L„, G'n 
and Gn- We start by bounding their diameters, which we denote from now on by 

7 = diam^(L„), 7 '= diam 5 '/(G(,), 70 = diam 5 (G„). 


Lemma 4.19. We have 

( 1 ) ^{p - 1 ) < 7 < np -b O(n^); 

( 2 ) — 1 ) < 7 ' < np -b 0 {n^); 

(3) <lQ<np + 0{n^). 

Proof. Item (3) follows from item (2) by Lemma 14.21 The upper bound of (2) trivially follows from the 
upper bound of (1). To prove the lower bound of (1) note that the elements of S^ are all in Sym{n) x 
[—i?, i?]"", and hence that (27 -b 1)” > p”. To prove the lower bound of (2), note that the projections of 
these elements to G'n lie in Sym{n) x Bn for some subset Bn of V with size at most {2R+ 1)”, and hence 
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that (27 + 1)" > p" For the upper bound of (1), note that 0{n?) steps are enough to cover Sym{n) 
(see 133 , for example), while p steps are enough to obtain the line (l;Fp, 0,... , 0) and thus (conjugating 
by powers of c) we obtain each coordinate line in p + n steps and thus all of F” in n{p + n) steps. Hence 
the diameter of with respect to S is at most np + 0{n‘^). □ 

Lemma 4.20. The only subgroups of G'^ of index < n are G'^ itself and Gn- 

Proof. It is well known that the only proper subgroup of Sym{n) with index < re is Alt{n), so if F is a 
subgroup of with index < re then vr(F) contains Alt{n). Since p > n, the projection vr is not injective in 
restriction to F. However, kervrnF is invariant under the action of Alt{n). It is well known that Sym{n) 
and Alt{n) act irreducibly on V, and so this implies that ker ttHF = V, and hence that Alt{n) k H C F. □ 

Proof of Fact Since \Gn\ < re!p"'“^ < (rep)”“^ < Lemma [4.191 implies that 70 3> \Gn\^ ■ In 

light of Lemma 14.201 therefore. Fact 14.171 is satisfied upon setting p = p(re) sufficiently large. □ 


We now pass to the proof of Fact 14.181 We use the following lemma. 

Lemma 4 . 21 . Let /3 G (0,1) and suppose that (loglogre)“^ < d < 1. Write t = d^/^ and h = 9/t^. 
Then, provided re > n{j5), every subgroup H < Sym[n) of order at least dTn\ has a subgroup Hq with 
[p[ : Pfo] < b that stabilises a subset Hq C {1,... ,re} of size at least ^re on which it acts as Sym{PLf) or 
Aitiyio). 

We prove Lemma 14.211 shortly. First, let us use it to prove Fact 14.151 


4n 


and 6 = ^, and 


Proof of Fact 4GS\ By Lemma 14.191 we have p^ ™ < 27 + 1 < 0{np). Setting e = 

taking Kn = 5^ = 5^®"’, Lemmas 14.91 and 12.21 therefore give a scale k G 7 ^^^] such that An := 
is a i^n-approximate subgroup. 

Assume that there are subgroups H < F in L„, such that H is normal in F, contained in 
T/H \s solvable, and is contained in fewer than 7 ^" log log log (;.Qggf;g gf p for some fixed a > 0 

to be determined shortly. We may choose p large enough that < 7 ^/^ < 2k, which by 

Lemma 1^71 implies that F has index at most ip" in L„,. Now set B = 5^®, so that Kn = H", 

and set a := 


1 


3 log B 


Note that a > 


If re is large enough, we then see that the index log log-g bounded above by d~^, where 

d = (loglogre)“ 2 . We may thus apply Lemma 14.211 to the subgroup 7 r(F) < Sym{n), because its index 
is at most d~'^ with d = (loglogre)“ 2 . We get a subgroup Fq < F of index at most b such that Fq fixes 
a subset Hq of {1,... ,re} of size at least ^re, where it acts as Sym{Llo) or Alt{Llo). Note that b < log re, 
and that ^re is at least (i^gnp ’ when re is large. Let Ho = H D Fq, and the natural projection from 
Sym{Llo) x Fp° to Sym{Llo). There is a natural restriction homomorphism from Fq to Sym{Llo) x Fp°, 
which consists in forgetting the entries outside of Hq. Let Fq, Hq be the images of Fq, Hq under this map. 

Now observe that if re is large enough Fq must contain Alt{Qo) ^ where Vhp is the subspace of Fp“ 
of vectors whose coordinates sum to 0. Indeed, to see this it is enough to show that IFqI > p|Ho|!, because 
then TToo will not be injective on Fq and there will be an element in the kernel whose coordinates are not all 
equal, so that the span of its orbit under Alt{Llo) will contain Vhp. On the other hand if IFqI < p|Ho|!, then 
iFol < p\no\lp^-\^°\in - |Ho|)! < re!pV“'^°', so that p^re! = |L„| < fe|ro|[L„ : F] < 6 re!p 
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which implies that if n is large enough. But Iflol ^ n/(logn)^ and p > , so this is 

impossible unless n is bounded. 

The group 7rQf^{Ho) is a normal subgroup of Sym{Qo). Hence it is either trivial, or must contain 
A/t(Ho)- If it contains Alt{Qo), then, since Hq is normal in Tq, and Tq contains Yuo, Hq must contain 
[Vhg, A/t(no)] = liio) hence Hq contains Alt{^lo) k Vqq. However we observed in the proof of Lemma 
14.191 the vector coordinates in are contained in the interval [—i?, i?]. Since Hq lies in we 

must have p < 2kf{Kn) = 0{f{Kn)yf7vp), which we may rule out by choosing p large enough in terms of 
n. It follows that t:q,q{Hq) is trivial. But this contradicts the fact T^Q^f^iVQ)/ t:q_q{Hq) is solvable (being a 
homomorphic image of Tq/Hq), because AU^^Iq) is not solvable if n is large enough. This contradiction 
ends the proof of Fact 14.181 □ 


The proof of Lemma 14.211 requires the following two preliminary steps. 

Lemma 4.22. There is no G N such that if d ^ [0,1], n > no, and H < Sym{n) is a subgroup of order 
at least dAnl, then H has an orbit of size at least |n. 

Proof. Let ni ,... ,ns be the size of the orbits of H. We have H < Sym{ni) x ... x Syrnfris), and hence 
\H\ < nil...nsl < n”^...n"®. If all the n^ are at most ^n, then we get \H\ < and hence 

n! < (^)"', which implies that n is bounded. □ 

Lemma 4.23. There is tuq G N sueh that i/t G [0,1], m > mo, and H < Sym{m) is a transitive subgroup 
of order at least then the maximal number of imprimitivity blocks of H is at most {3/t)‘^. 

Proof. Let a be the maximal possible number of blocks of imprimitivity. If H is not primitive, then a is 
a proper divisor of m and thus a < ^. Moreover H is contained in the wreath product of Sym{a) with 
5?/m(^), so that \H\ < a!((^)!)“. Hence 

But (y)™ < m! as soon as m is large enough. So this yields t”* < and the result follows. □ 


Proof of Lemma 4-^l\ By Lemma 14.221 H has an orbit Ll of size m > |n. The action of LI on H may 
not be primitive. Let Hq be a minimal block of imprimitivity, and Hq the stabiliser of Hq in H. Let K 
be the restriction of LI to H and Kq the restriction of Hq to LIq. Note that [Sym{Ll) : K] < [Sym{n) : 
H] < < f™. We may thus apply Lemma 14.231 to K and get that the maximal number a of blocks of 

imprimitivity is most b = 9t~^. In particular [H : Hq] < b. Similarly [Sym{QQ) : Kq] < [Sym{n) : Hq] < 
bd-^ < bt-^ < t-^^, and so liLol > 

On the other hand, since the action of Kq on LIq is primitive, Bochert’s theorem |16l Theorem 3.3B] 


{m/ay. 


Combining 


implies that unless Kq acts as Sym{LlQ) or Alt{LlQ) on flo it satisfies |iLo| < {[{m/a+i)/2 ])\ ' 
these two inequalities yields {mf thus m < 4at“^“, and hence n < This implies 

that n is bounded in terms of /3 if d > l/(log log n)^. □ 
















28 


EMMANUEL BREUILLARD AND MATTHEW C. H. TOINTON 


4.2. Doubling close to the diameter and a question. We end this section by asking a question. 
Lemma 14.91 shows that the almost flat condition implies doubling at some controlled scale. Moreover, 
doubling at some scale is enough to apply Theorem 12.,41 yielding the following. 


Proposition 4.24. Consider a finite group G with symmetric generating set S. Let K > 1. There is an 
integer ni = ni{K) depending only on K such that if n is such that n > ni and 

|^2n+l| < 

then S'®” contains a normal subgroup H of G such that G/H has a nilpotent subgroup whose index, rank 
and nilpotency class are bounded in terms of K only. 


Proof. By Lemma I 2 . 1 UI IS"^”! < C(iL)|S”| for some constant C{K) depending on K only. 
Proposition 12.91 


Now apply 
□ 


Note that the quotient G/H is non-trivial whenever n < ^ 7 , where 7 is the diameter of G with respect 
to S. 


Question: Assume that |S'^”+^| < K\S^\ holds for some n G [^ 7 , (1 — 6 ) 7 ], say. Is it true that G has a 
normal subgroup H G such that G/H has a nilpotent subgroup whose index, rank and nilpotency class 
are bounded in terms of K, e only? 


5. ChEEGER CONSTANT AND SPECTRAL GAP 

In this section we prove Corollaries 11.51 [L 6 l 11.71 and 11.81 We first recall some definitions. 

Let ^ be a finite connected fc-regular graph. Given a set A C P (Q), denote by dA the edge boundary of 
A, defined as the set of edges between A and V{G)\A. The Cheeger constant h{G) of a graph Q is defined 
by 

h(g) = inf|M:AcP(e),|A|<i|a||. 

Recall that Q is called an e-expander if h{Q) > e; see the survey (25] for detailed background. Expansion 
may be considered in terms of the spectrum of the Laplacian operator on the graph. Specifically, the 
Laplacian A on our connected fc-regular graph G is an operator on the finite-dimensional vector space 
i‘^{G), and is defined by 

A/(x) = kf{x) -'^f{y), 

yr-jx 

where y ~ x means that y and x are connected by an edge in the graph. The Laplacian is self-adjoint on 
£^( 0 ), and so its eigenvalues 

Ao(0) = 0 < Ai(^) < ... < \\g\_i{G) 

are real. The link between this and expansion as defined above is given by the discrete Cheeger-Buser 
inequality, which states that 

\h{Gf <\i{G)<2k-h{G). (5.1) 

See the book m for a proof of these inequalities and the references therein for further historical back¬ 
ground. Furthermore, we record the following well-known additional inequalities, which together with 
(15.ip imply (jl.3p . 






NILPROGRESSIONS AND GROUPS WITH MODERATE GROWTH 


29 


Lemma 5.1 (Diameter versus expansion). 


If Q is vertex transitive then 


For every finite k-regular graph Q we have 

4/ciog \g\ 


HG) < 


1 


diam(^) 


< h{G). 


2 diam(^) 

Lemma ED is both standard (see e.g. m for a variant) and straightforward, but for completeness we 
present a short proof. Just as the proof of the Cheeger-Buser inequalities, this is based on the variational 
characterisation of Ai and h, which gives 


Ai(^) = inf 


h{G) = inf 


l|V/||( 
2 
2 


/ = 0 


liv/lh 


(5.2) 


(5.3) 


infmeK 11/ - m-lli. 

Here the infimums are taken over all real valued functions on the set of vertices of the graph. The norm 
II/II 2 is the norm, while ||/||i denotes the norm, defined as usual by ||/||i = Ylxev{g) l/(^)l- Finally, 
V/ is the non-negative function defined on the set of edges by V/(e) = |/(e"'“) — f{e~)\, where e~^ and 
e~ are the vertices of the edge e. The identity (15.21) may be found in [37j. Since we do not have a readily 
available reference for (15.311 . we provide a proof below. First let us use it to prove Lemma l5.11 


Proof of Lemma \5.1[ Abbreviate h = h{G), the Cheeger constant of G- Given x G G and r > 0 , 
write Br{x) for the closed ball of radius r centred at x for the graph metric on G, and write Sr{x) = 
Br{x)\Br-i{x) for the sphere of radius r centred at x. 

By definition of h, if \Br{x)\ < ^\G\ then we have \dBr{x)\ > h\Br{x)\. Since each vertex of 5r.+i(x) is 
incident with at most k edges in dBr{x) we have |5r-+i(x)| > k~^\dBr{x)\, which implies that 

\Br+l{x)\ > ^1 -|- 

Noting that ^ < 1, and hence that 1 -|- ^ > e^, it follows immediately that \Br{x)\ > min {4|^|, 

In particular, if > 2log \G\ then the balls Bj.{x) and Br{y) have size at least ^\G\, and hence non-trivial 
intersection for every choice of x and y. This proves the first inequality of the lemma. 

Now assume that G is vertex transitive. Set 7 := diam(^), and write G = Aut(0), the (finite) group of 
automorphisms of the graph. Pick a base vertex xq in the graph and connect every other vertex y to xq 
by a geodesic path, say xq = yo, Vi: ■ ■ ■ ^Vn = y, noting that n < 7 . For any real-valued function / on the 
vertices of G we have 

n n 

^ \ - /(cj(y))| \ f{a{yi-i)) - f{(y{yi))\ < ||V/||i ^ |Ge,|, 

o-GG 1 a£G 1 

where Ge^ is the stabiliser of the edge e, between yj_i and yi. Clearly G^i has a subgroup of index at 
most 2 that fixes y,, and hence |GeJ < 2|G|/|^|. Summing over all vertices y of G, it follows that 

in^ 11 / — m||i < 11 / - \fHxo)) - fi<T{y))\ < 27 IIV/II 1 

' ' <t€G ' ' aGGyeg 
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and so the lemma follows from (j5.3p . □ 

Proof of (|5.3I) . To see that h{G) dominates the right hand side, we simply pick the function / := lA—^g\A-, 
where is a subset of vertices of size at most \Q\/‘i realising the infimum in the definition of h{Q). Then 
note that ||V/||i = 2|c?A| and that ||/ —m||i is minimised by taking m = —1, which makes its value equal 
to ‘2\A\. Now to verify the opposite inequality, given a function / on the vertices consider the subsets of 
vertices Vtt := {x : f{x) > t} and let m := inf{t E M; < |^|/2}. Note that if t > m, then 1^*1 < |0|/2, 
while if t < m, then \Q \ 0^1 < \Q\l2. On the other hand, ||/ — m||i = — m)+(x) + (m — /)+(x)], 

where u'^ = max{0, u} is the positive part of the real number u. Hence 

/ m /*+oo I /*+co 

\g\Qt\dt+ / < - / \dQt\dt, 

-oo Jm ^ J —CO 

since dQt = d{g \ Ht). On the other hand, observe that for every edge e we have V/(e) = lgfi^{e)dt, 
and so ||V/||i = \dQt\dt. The inequality follows. □ 

We now pass to the proofs of Corollaries 11.51 and 11.61 

Proof of Corollary 11.51 We have 7 ^ > by the left hand side of (II.3p . and so if Ai < 2“® then 7 > 8 , 
and hence 7 ^ > 1/Ai. On the other hand, 7 > l/2h by Lemma l5.ll so if /i < | then 7 > 2, and hence 
72 > 1/h. □ 

To prove Corollary 11.61 we use the following general lemma. 


Lemma 5.2. Let Q be the Cayley graph of a finite group G with symmetric generating set S and diameter 
7 . Then 


Xi{G) < 


9|5| |G| 
72 1 57/3 1 


Proof. Pick a,b G G such that d{a,b) = 7 , and let f{x) := d{x,a) — d{x,b), where d{x,y) is the graph 
distance in Q. Note that / has zero mean and that |V/(e)| < 2 for every edge e. Hence HV/Hl < 2|5||G|. 
On the other hand |/(x)| > 7/3 on the balls of radius 7/3 around a and around b. Hence H/lH > 
2 ( 7 / 3 ) 2 |S' 3 '/ 3 |. The result therefore follows from (|5.2I) . □ 


Proof of Corollary 11.61 Set K = 5^/^, and let uq = no{K) be the constant appearing in the conclusion of 
Theorem 11.11 Provided 7 > Uq, Lemma 14.91 implies that there is n satisfying uq < n < 7^/2 such that 
|^ 5 n| ^ \S^\. Theorem 11.11 therefore implies that |G| = \S'^\ < Oi^(l)|S'3'/^|. The bound (|1.5I) therefore 
follows from Lemma 15.21 and the bounds (11.411 then follow from (|1.3p and (|1.5p . □ 


Remark 5.3 (Cheeger constant versus Ai). We have shown that almost flat groups satisfy the strong form 
of the Buser inequality; that is to say Ai is comparable to the square of the Cheeger constant. This 
property does not characterise almost flatness. Indeed it is shared for example by the Cayley graph of the 
lamplighter group based on a finite cyclic group (see m). which is by no means almost flat. By contrast, 
there are Cayley graphs of finite groups at the opposite extreme permitted by (II.3p . which is to say Ai is 
comparable to the Cheeger constant itself, not its square. This happens, for example, for the lamplighter 
group based on Z/n'^Z for any given d > 3. For all d > 1 the Cheeger constant is comparable to I /7 
(almost extremal sets are obtained by setting one fixed light off), but Ai is comparable to I /72 if d = 1 , 
to 1 / 7 log 7 if d = 2, and to I /7 if d > 3 [40] . 
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Remark 5.4 (Expansion and number of generators). Almost flat groups are not expanders, since their Ai 
values are comparable to 1 / 7 ^ by Corollary 11.61 In fact, in order to construct a Cayley graph of a finite 
group G with a nilpotent subgroup of bounded index, rank and nilpotency class with a positive lower 
bound on Ai, one would need at least a constant times log |G| generators. This follows immediately from 
Lemma l4.131 and the corresponding result for abelian groups, which is proved in [381 Propositions 3.9 & 
3.10]. Note that this is extremal in some sense, because according to Alon and Roichman [U Theorem 
1], given e > 0, every finite group G has a generating set S of size at most C^loglGj with e-expanding 
generating set. 

We now pass to the proof of Corollaries 11.71 and 11.81 

Proof of Corollary [Z3 We fix a finite symmetric generating set for T and consider the Cayley graphs of 
the quotients T/T^ with respect to this generating set. The second inequality in Lemma l5 .1 1 implies that 
diam(r/r„) > ^jr/T^j^. Being finitely generated, T has only finitely many subgroups of index at most 
AI for any given M > 1, and so it follows that T/T^ is |-almost flat as soon as n is large enough. It then 
follows from Theorem 1 1.31 that T/r„ has a quotient of diameter at least ^ diam(r /Tn) that has an abelian 
subgroup whose index is bounded independently of n. This implies that there is a finite-index subgroup 
H <T that maps onto abelian groups of unbounded order, which in particular means that H/[H,H] is 
infinite. Since H/[H, H] is also finitely generated and abelian, it admits a surjective homomorphism onto 
Z. □ 

The proof of Corollary 11.81 the manifold version of Corollary 11.71 is a simple application of the Brooks- 
Burger transfer principle, for which we refer the reader to the paper [39] and the appendix to the lecture 
notes [5], as well as the original sources This states that given a compact connected Riemannian 

manifold M, and a finite symmetric generating set S of its fundamental group, there are positive constants 
ci,C 2 such that for every finite-degree Riemannian cover AI' AI, we have 

ciAi(M') < Ai( 7 ri(M)/ 7 ri(M'), 5 ) < C 2 Ai(M') (5.4) 

where the \i{AI') denotes the first non-zero eigenvalue of the Laplace-Beltrami operator on AI' and 
\i{'i:i{M)/'Ki{AI'),S) that of the corresponding Schreier graph. Similarly, 

cih{Al') < h{^i{AI)/TTi{Al'),S) < C 2 h{Al'). (5.5) 

It is worth emphasising that the constants ci, C 2 here depend only on the base manifold AI and choice 
of S, but not on AI'. 

Proof of Corollary {LM This is just the combination of (15.51) and Corollary 11.71 □ 

6. Moderate growth and mixing times 

In this section we prove Corollary 11.91 which shows that almost flat groups and groups with moderate 
growth coincide. We then discuss mixing-time estimates for random walks on almost flat groups, and 
prove Corollary 11.101 and Theorem 11.111 

We start with a crude, yet general, bound on the diameter of a Cayley graph (which is meaningful only 
when S is large, since otherwise it is worse than the trivial bound diam 5 (G) < |G|). 

Lemma 6.1. Let G be a finite group and S a symmetric generating set. Then 

diams(G) <2(|G|/|5|)T 
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Proof. Let m be the least integer such that II < 1 1 5^"*|, noting that this implies that |G|/| S'! > (|)™- 
Freiman’s lemma |19] shows that the bound implies that 5^"* is contained in a coset of 

a subgroup H < G satisfying \H\ < Since 1 € 5, we in fact have 5^"* C H, and since S generates 

G this implies that H = G, and in particular that > ||G'|- This in turn implies that = G, 

7 

and hence that diam 5 (G) < 2.2™'. The result follows since (|)"‘ >2. □ 


Proof of Corollary \1.9i. In the first direction, suppose that {G,S) has (j 4, (i)-moderate growth, which is 
to say that (11.61) holds for all n > 1. The case n = 1 implies that 7 > A~~^/'^{\G\/\S\Y/'^. However, 
7 < 2(|G|/|S'|)^ by Lemma [6Tl so {G,S) is ^-almost flat whenever 7 > 2A‘^. 

4 

In the converse direction, suppose that {G,S) is e-almost flat, let K = 5e, and let no = no{K) be the 
integer given by Theorem ll.il If 7 < -Rq then the almost flat assumption implies that for every n < 7 we 
have 

|S”I > |S| > (;|)‘ ■ |5| > (^)‘ ■ |C|. 

and so G has {n^Q , e“^)-moderate growth. We may therefore assume that 7 > Uq. Lemma 14.91 therefore 
implies that there is some n satisfying no < n < 7 ^/^ such that < K\S^\, and so Theorem 11.11 implies 
that < 9{K)^\S^\ for all n > 7 ^/^. Given n > 7 ^'^^, therefore, writing r for the smallest integer such 
that 5’'n > 7 we have < e{Kf^\S^\ and e{Kf^ < 0 (iL) 5 + 5 i°g 5 ( 7 /n) = 9{Kf{-f/nf^°^5im))^ and 

so 

|S"|>i(^)''|0| (6.1) 

holds with A = 9{K)^ and d = 5 log 5 9{K). When n < 7 ^'^^, on the other hand, the almost flat assumption 
implies that |S'|/|G| > 1 / 7 '^ , and so 


gn 


|G| 


>]T> 

“ |G| - 




Thus ()6.ip holds with d = 2e ^ and A = \. 


□ 


The main result of the work m of Diaconis and Saloff-Coste shows that groups with moderate growth 
have mixing time quadratic in the diameter. Inspired by this, we spend the rest of this section discussing 
various other sufficient conditions on the Cayley graph of a finite group for its mixing time to be quadratic 
in the diameter. 

We should emphasise that the constants in the results of Diaconis and Saloff-Coste are effective in the 
moderate-growth parameters of the group in question; the ineffectiveness of the constants appearing in 
the results of this section arise only due to our reliance on Theorem 12.31 


By definition, the L'P mixing time Tp is the first integer n such that ~ l^cWp < where /as is 

the uniform probability measure on the generating set S and /iq the uniform probability measure on G. 
We view them as functions on G taking non-negative values. Of course, the choice of factor is purely 
a matter of convention. It is customary also to define the relaxation time Tj.ei as the inverse spectral gap, 
or more specifically = 1/(1 ~ where jdg is the norm of the operator 1 — A/IS*!. For simplicity, we 
will assume that /?§ = 1 — Ai/|5|, which amounts to assuming that A has no eigenvalue too close to 2|5|. 
This is the case if Ai < 2 and 1 G S'. 
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The mixing times Tp dominate the relaxation time T^ei and are non-decreasing in p, and in fact there 
are essentially only two mixing times, Ti and Too, because Tp is comparable to Too if p > 1- The example 
of the lamplighter group with cyclic base shows that Too really can be much larger than Ti [251 So]. 
Writing 7 = diam 5 (G), we have ^diam 5 (G) < Tp < 0 | 5 |( 7 ^), and the same lamplighter example shows 
that the upper bound can indeed be reached for Too [251140j . It is not known, however, whether Ti is 
always 0151 ( 7 ^). We summarise these facts and some others in the following proposition, which is well 
known and encapsulates the basic properties of mixing times on Cayley graphs. 

Proposition 6.2 (Basic facts on mixing times). Let G be a finite group with symmetric generating 
set S containing 1 and denote by 7 the diameter of the Cayley graph. Assume that Ai < 2 , and set 
/3s := 1 — Ai/IS*!. Then, forp G [l,oo] and n G N, the following conditions hold. 

(1) The quantity ||/Us"^^ — hcWp is non-increasing in n. 

(2) The quantity — ^gIIp/IIaigIIp is non-decreasing inp, and hence so is Tp. 

(3) We have /3g < - ncWp/WhcWp- 

(4) We have 

IIMs’"^ - hcWp ( lll^s^ -3 ^g||2 \ 

WhcWp WhGh ) ■ 

(5) We have - uch < Ps- 

( 6 ) For every p > 1 there exists C{p) >0 such that Too < C{p)Tp. For p > 2 we may take C{p) = 2. 

(7) We have Tp > ^7 for all p, and Too > 7 - 

( 8 ) We have T 2 < 8 ISI 72 log |G| < 8 |S| log \S\j^. 

(9) We have Trei < m.m{Ti,8\S\j'^}. 

We include the straightforward proof for the reader’s convenience. 

Proof. Condition ([T]) holds with || • ||p replaced by any G-invariant norm || • ||, since the triangle inequality 
implies that ||/Us * /|| < ||/|| for every / : G —)• M. To prove ([2D, assume that p < p' < 00 and note 
that the function u 1 —)• is convex, and hence that we may apply Jensen’s inequality in the form 

hcif^'^^) > ihG{f)T'^^ to the function f{x) = p!"g\x) — ncix), from which ([2D follows (the case p' = 00 
being a trivial calculation). 

For the bound ([3D, let g be such that 1/p -\- 1/q = 1, so that by duality we have 

lll^s ^ -/^gIIp = sup \fisif) - hG{f)\- (6.2) 

Il/ll9<l 

Let / be an eigenfunction of the Laplacian with eigenvalue Ai, translated so that /(I) = ||/||oo and scaled 
so that WfWq = 1. Note that pcif) = 0, and also that * f = hence that /U^^(/) = /3^/(l) = 

fisWfWoo- It therefore follows from ([OD that - pcWp > /^sll/IU > Ps\\f\\q\^\~^^'^ = /^sll/^cllp, and 
so ([3D follows. 

It is enough to prove (jlD for p = 00 , but this follows from the Cauchy-Schwarz inequality applied to 
the obvious identity - PG)ix) = (ps - hG)^‘^'^\x) = Ylyi^S - hG)^'^\xy-^){ps - pG)^'^\y), which 

is valid for every x G G, and yields — I^gIIoo < ~ hG{x)\\ 2 , and hence (HD after observing that 

II/^gIIoo = ll/^GlIi- 
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To see m , write for the operator of left convolution by fj-s, and note that (16.2h gives 

Slip l^■;,/(l)l<ll^■;,ll2 = ^is. (6.3) 

II/I|2 <i,/.g(/)=o 

It is trivial that Too > 7) so to prove d?]) it suffices to check that Ti > ^7. To see this, note that the 
ball of radius Ti contains at least half of the elements of G, since otherwise and so the 

ball of radius 2Ti is all of G. 

The fact that ([H) holds with C'(p) = 2 for p > 2 follows from ([2]) and dH). When p E (1,2), applying 
Young’s inequality to the identity pc) = implies that ~/w|| ~ f^cWp, 

and hence that T p < 2T„. Iterating this a finite number of times if necessary, we see that T2 < G'(p)Tp 
for some constant G'{p) > 0, and dS]) follows. 

The first inequality of dll) follows immediately from dS]) and the lower bound on Ai from ()1.3p . while the 
second inequality follows from the fact that |G| = |S"^| < Finally, to prove dS]), note that (|1.3I) shows 
that Tpf^i < 8|5|7^, while the fact that Tj.^i < Ti follows from dS]) and the assumption that Ai < 2. □ 

Proof of Corollary \1.1(A The lower bound follows immediately from ([3]) and the bound (II.5p from Corol¬ 
lary [T21 For the upper bound, we need to improve Proposition 16.21 (|5p into — Pclb < BI3'g\\pLG\\2- 
Getting this extra factor ||/Ug ||2 is the heart of the matter, and it was established by Diaconis and Saloff- 
Coste in |15l Theorem 3.2] for groups of moderate growth with an explicit dependence of the constant B 
in terms of the moderate growth parameters A and d. Corollary 11.91 implies that G has such parameters 
depending only on e, and so the desired bound follows. (The reader should note that, while [151 Theorem 

3.2] is stated for — PgIIij it also holds for U/x^^ — PGII2/II1XGII2 by Remark 2 on page 6 of [15] .) □ 

We now discuss and prove Theorem 1 1.1 II Almost flat groups or groups of moderate growth are doubling 
at at least one scale < 7*^ for any fixed 5 > 0 by Lemma 14.91 and it is legitimate to ask whether this 
property alone is enough to give a mixing time quadratic in 7. The following reformulation of Theorem 
ll.lll shows that if a Cayley graph is doubling at one scale < 7^/^ then this is indeed enough to guarantee 
quadratic mixing time, thus extending the validity of the Diaconis-Saloff-Coste theorem |15] . 

Theorem 6.3 (Doubling and quadratic mixing time). Let G be a finite group with symmetric generating 
set S and denote by 7 the diameter of the Cayley graph. Let K > 1 be a parameter. Then there is a 
constant Ck > 0 depending on K only such that if < iFjS’’^] for some n > Ck the 

• if n < 7/3, then is quadratic in the sense that Trel > iVCk; 

• if n < 7^/^, then Tp is also quadratic for each p E [l,oo], in the sense that Tp = Oi<',|s|(7^). 

In proving Theorem 16.31 we make use of the following variant of Lemma 15.11 

Lemma 6.4. Let Q be the Cayley graph of a finite group G with symmetric generating set S. Let H < G 
be a normal subgroup and let 'Jh be the diameter of H in the graph distance. Assume that f is a function 
on G such that ’Ylx&gH /(®) ~ b every g ^ G. Then 

l|v/lli ^ 1 
ll/lli 

Proof. Pick y & H and connect it to the identity by a geodesic path 1 = yo,yi,... ,yn = y, where yi = 
SiPi-i for some Si E 5, and n < 7//. Observe that for each i we have IfiVi-id) ~ /(2/ifi')P ^ ll^/lli- 
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The Cauchy-Schwarz inequality in the form (X]r=i 1^*1)^ — then gives 

n 

1/(5) - fiyg)? “ fivid)? < ThIIV/III. (6.4) 

g&G g£G 1 

On the other hand, for 5 G G we have EyeH 1/(5) “ fiva)? = 11/ - fiaWii^^gH) since gH = Hg, and so 

X] 1/(5) - /(55)P > Il/lli2(gj;^) (6.5) 

yen 

since / has mean zero on gH. Summing (|6.4p over y ^ H and (16.51) over g G G shows that I^^HI/Hi < 
EgGG Eyg// 1/(5) - /(y5)P < iHljjjWVfWl, and so the lemma is proved. □ 

Proof of Theorem 1 5. ,91 The first item follows directly from the combination of Theorem 11.11 and Lemma 

K2\ 

For the second item, we first note that if Ck is chosen large enough then Proposition 14.241 implies 
that there is a normal subgroup H < G contained in such that G/H has a nilpotent subgroup 

whose index, rank and nilpotency class are bounded in terms of K only. Note that the diameter ^g/h of 
the quotient Cayley graph is at least 7/2 (unless 7 is bounded), and hence that, by Proposition 14.71 and 
Lemma [4.2l this quotient Cayley graph is e-almost flat for some e depending only on \S\ and K. Corollary 
(LTO] therefore applies, and we conclude (denoting by P := P^^ the operator of left convolution by g-s and 
recalling (16.31) 1 that if u is a zero mean function in i‘^{G/H), then ||P”u||2 < ll^lbllliG///II2 for 

all n > 1 and for some positive constants B,G depending only on \S\ and K. 

Now we decompose the space £‘^{G) as a direct sum i‘^{G) = £‘^{G/H) © i‘^{G/H)-^, where £‘^{G/H) is 
this time viewed as the subspace of Lf-invariant functions and £^{G/H)-^ its orthogonal, the subspace of 
functions with zero mean on each coset of H. Both spaces are G-invariant, and given a function f on G 
we may write / uniquely as f = u + v, where u G £‘^{G/H) and v G £‘^{G/H)-^, and H/Hl = ||u||2 + \\v\\^. 
In particular, this implies that ||P”/||2 = However, the norm of P in restriction 

to £^{G/H)-^ is given by 1 — A/|5| < where A is the smallest eigenvalue of A in restriction to 

£‘^{G/H)-^. Lemma 16.41 shows that A > 1/7!^, where 'Jh is the diameter of H in G, and so we have 

_ n 

WP'^vh < e 

Now note that — go = H”(/), where f = le — gG- Decompose / as / = / + (/ — /), where / 
is the H-invariant function f{g) := YlxGgH fi^)^ f ~ f has zero mean on Lf-cosets, and note 

therefore that 


lie!,”* - Mclli = lT”{/)lli + wr'if - /)|ii < 

Finally, observe that / = — hG-, that ||/||| = 1/|H| — 1/|G| < 1/|H| = 

11/ - /111 = 1 - < 1- Hence 


+ e ^||/-/|| 

\\hG\\V\\hG/H\\l 


2 

2 

and that 


IImST’ - MGlli < 

Now note that 7|f log IG| = 7!^ log(|G|/|H|)+ 7!^ log |H|. But G/H" is e-almost flat, so |G/i?| < 1517^/*^, 
and clearly \H\ < so log |H| < 7/f log IS*!. On the other hand H is contained in the ball of radius 
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87^/^ SO < 87^/^. It follows that 

7|^log|G| <^-,151 7ilog7 + 7| < 7^, 

and therefore that —/^clb < as soon as n S>r',| 5| 7^- This gives the result for T 2 , and hence 

for all Tp by Proposition 16.21 □ 

We hnish by presenting a simple example showing that the exponent | in the second item of Theorem 
ll.lll is sharp. We thank T. Zheng for suggesting it. 

Example 6.5 (Sharpness of the | in Theorem ll.lip . Let Lm be the lamplighter group Lm '■= Z/MZ x B, 
where B is the direct product of M copies of Z/2Z that are cyclically permuted by the action of Z/MZ, 
and let G := Lm x 'LjN'L. Let S' := 5i x S'2, where Si is the standard generating set of Lm (consisting 
of the identity, a move of one space to the left or right, or a switch of the current lamp), and S'2 is the 
standard generating set {0,±1} of Z/iVZ. Clearly the diameter 7 of G is comparable to max{M, A^}. 
On the other hand, the uniform mixing time Too(G) is at least max{Toa{LM),Too{'^/N'Z)}, which is 
comparable to max{M^, } (see [25]). Let a E (|, 1) and set M ;= A^“. Then 7 ~ A^, and Too{G) grows 
faster than ~ 7^. However, M ~ 7“, and we have uniform doubling at all scales > M. 
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